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MODIFIED PLASMID AND USE THEREOF 



FIELD OF THE INVENTION 

5 The present application relates to modified plasmids and uses thereof. 
BACKGROUND OF THE INVENTION 

Certain closely related species of budding yeast have been shown to contain 

10 naturally occurring circular double stranded DNA plasmids. These plasmids, 
collectively termed 2 jxm- family plasmids, include pSRl, pSB3 and pSB4 from 
Zygosaccharomyces rowrii (formerly classified as Zygosaccharomyces bisporus), 
plasmids pSBl and pSB2 from Zygosaccharomyces bailii, plasmid pSMl from 
Zygosaccharomyces fermmtati, plasmid pKDl from Kluyveromyces 

15 drosphilantm* an ra-named plasmid from Pichia membranaefaciens (hereinafter 
referred to as "pPM!") and the 2jim plasmid and variants (such as Scpl, Scp2 and 
Scp3) from Saccharomyces cerevisiae (Volkert, et aL 7 1989, Microbiological 
Reviews, 53, 299; Painting, et a!., 1984, J. Applied Bacteriology? 56, 331) and 
other Saccharomyces species, such as S. carlsbergensis . As a family of plasmids 

20 these molecules share a series of common features in that they possess two 
inverted repeats on opposite sides of the plasmid, have a similar size around 6-kbp 
(range 4757 to 66 15-bp), at least three open reading frames, one of which, encodes 
for a site specific recombinase (such as FLP in 2 pm) and an autonomously 
replicating sequence (ARS) 7 also known as an origin of replication (pri), located 

25 close to the end of one of the inverted repeats. (Futcher, 1988, Yeast, 4, 27; 
Murray ef al. t 1988, J. Mol Biol. 200, 601 and Toh-e et al, 1986, Basic Life Sci. 
40, 425). Despite their lack of discernible DNA sequence homology, their shared 
molecular architecture and the conservation of function of the open reading frames 
have demonstrated a common link between the family members. 
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Hie 2\xm plasmid (Figure 1) is a 6,318-bp double-stranded DNA plasmid, 
endogenous in most Saccharomyces cerevisiae strains jat 50-100 copies per 
haploid genome. The 2|im plasmid comprises a small unique (US) region and a 
large unique (XJL) region, separated by two 599-bp inverted repeat sequences. 
5 Site-specific recombination of the inverted repeat sequences results in inter- 
conversion between the A-forrn and B-form of the plasmid in vivo (Volkert & 

1 

Broach, 19S6, Cell, 46, 541). The two forms of 2yaa differ only in the relative 
orientation of their unique regions. 

j 
i 

10 While DNA sequencing of a cloned 2pm plasmid (also known as Sep!) from 
Saccharomyces cerevisiae gave a size of 6,318-bp (Hartley and Doneison, 1980, 
Nature, 286, 860), other slightly smaller variants of 2pm, Scp2 and Scp3» are 
known to exist as a result of small deletions of 125-bp and 220-bp, respectively, in 
a region known as STB (Cameron et al., 1977, Nucl Acids Res., 4, 1429; Kikuchi, 

15 1983, Cell, 35, 4S7 and Livingston & Hahjie, 1979, Proc. Natl Acad. Set USA, 
76, 3727). In one study about 80% of natural Saccharomyces strains from around 
the world contained DNA homologous to 2 jam (by Southern blot analysis) 
(Hollenberg, 1982, Current Topics in Microbiology and Immunobiology* 96, 119)* 
Furthermore, variation (genetic polymorphism) occurs within the natural 

20 population of 2nm plasmids found in S. cerevisiae and 51 carlsbergensis, with the 
NCEI sequence (accession number NC_001398) being one example. 

The 2 jim plasmid has a nuclear localisation and displays a high level of mitotic 
stability (Mead et al, 1986, Molecular General Genetics, 205, 417). The 
25 inherent stability of the 2|um plasmid results from a plasrrrid-encoded copy number 
amplification and partitioning mechanism, which is easily compromised during the 
development of chimeric vectors (Futcher & Cox, 1984, J. BacterioL, 157, 283; 
Bachmair & Ruis, 1984, Monatskefte fllr Chemie, 115, 1229). A yeast strain, 



i 
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which contains a 2^ni plasmid is known as [cir 4 ], while a yeast strain which does 
not contain a 2pm plasmid is known as [cir 0 ] ■ ! 



The US-region contains the KEP2 and FLP genes, and the UL-region contains the 
5 REP1 and JD (also known as RAF) genes, the iSTfi-locus and the origin of 
replication (Broach & Hicks, 1980, Cell, 21, 501; Sutton & Broach, 1985, Mol 
Cell BioL> 5, 2770). The Flp recombinase hinds to FRT-sites (Flp Recognition 
Target) within the inverted repeats to mediate site-specific recombination, which 
is essential for natural plasmid amplification and control of plasmid copy number 
10 . in vivo (Senecoff et at 9 1985 a Proc. Natl Acad. Set U.S.A., 82, 7270; Jayaxam, 
1985, Proc. Natl Acad. Scl U.S.A., 82, 5875). The copy number of 2pm-faniily 
plasmids can be significantly affected by changes in Flp recombinase activity 
(Sleep et al, 2001 ? Yeast, IS, 403; Rose & Broach, 1990, Methods Enzymol, 185, 
234), The Repl and Rep2 proteins mediate plasmid segregation, although their 
15 mode of action is unclear (Sengupta et al 7 2001 ? J. Bacterial, 183, 2306). They 
also repress transcription of the FLP gene (Reynolds et al 9 ,1987, Mol Cell Biol, 7 ? 
3566). 

The FLP and REP2 genes are transcribed from divergent promoters, with 
20 apparently no intervening sequence defined between them. The FLP and REP2 
transcripts both terminate at the same sequence motifs within the inverted repeat 
sequences, at 24-bp and 178-bp respectively after their translation termination 
codons (Sutton & Broach, 1985, Mol Cell Biol, 5, 2770), 



25 In the case of FLP, the C-terminal coding sequence also lies within the inverted 
repeat sequence. Furthermore, the two inverted repeat; sequences are highly 
conserved over 599-bp, a feature considered advantageous to efficient plasmid 
replication and amplification in viva, although only the FRT-sites (less than 65-bp) 
are essential for site-specific recombination in vitro (SenecofF et al 9 1985, Proa 

30 Natl Acad. ScL U.S.A., 82, 7270; Jayaram, 19S5, Proc. Nail Acad. Set U.&A., 82, 

3 j 

i 

i 
1 
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5875; Meyer-Leon et al, 1984, Cold Spring Harbor Syttjfrosia On Quantitative 
Biology, 49, 797)- The key catalytic residues of Flp are axginiue-308 and tyrosine- 
343 (which is essential) with strand-cutting facilitated by histidine-309 and 
histidine 345 (Prasad et al, 1987, Proc. Natl Acad. ScL U.S.A., 84, 2189; Chen et 
5 al, 1992, Cell, 69, 647; Grange et al, 2001, Mol Biol, 314, 717). 

Two functional domains are described in Rep2. Residues 15-58 form a Repl- 
eading domain and residues 59-296 contain a self-association and STB-bindmg 
region (Se?igupta et al, 2001, X Bacterial, 183, 2306). 

id 

Chimeric or large deletion mutant derivatives of 2ixm which lack many of the 
essential functional regions of the 2pim plasmid hut retain functional the cis 
element ARS and STB, cannot effectively partition between mother and daughter 
cells at cell division. Such plasmids can do so if these functions are supplied in 
15 trans, by for instance the provision of a functional 2fim plasmid within the host, a 
so called [cir + ] host. 

Genes of interest have previously been inserted into the I UL-iegion of the 2^un 
plasmid- For example, see plasmid pSAC3Ul in EP 0 286 424. However, there is 
20 likely to be a limit to the amount of DNA that can usefully be inserted into the 
UL-region of the 2jum plasmid without generating excessive asymmetry between 
the US and XJL-regions. Therefore, the US-region ojf the 2|um plasmid is 
particularly attractive for the insertion of additional DNA sequences, as this would 
teud to equalise the length of DNA fragments either side of the inverted repeats. 

25 

This is especially true for expression vectors, such as that shown in Figure 2 ? in 
which the plasmid is already crowded by the introduction of a yeast selectable 
marker and adjacent DNA sequences. For example, the plasmid shown in Figure 
2 includes a p-lactamase gene (for ampicillin resistance), a LEU2 selectable 
30 marker and an oligonucleotide linker, the latter two of which are inserted into a 

4 ;' 
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unique SnaBI-site within the IIL region of the 2pm-family disintegration vector, 
pSAC3 (see BP 0 286 424). The K colt DNA between fheJ^al-sites that contains 
the ampicillin resistance gene is lost from the plasmid shown in Figure 2 after 
transformation into yeast This is described in Chinery & jHinchliffe, 1989, Curr. 
5 Genet, 16, 21 and EP 0 286 424, where these types of jvectors are designated 
"disintegration vectors". In the crowded state shown in Figure 2, it is not xeadity 
apparent where further polynucleotide insertions can be made. A Notl-site within 
the linker has been used for the insertion of additional DNA fragments, but this 
contributes to further asymmetry between the TJL and US regions (Sleep et al 9 
10 1991, Biotechnology (NTQ, 9, 183). 

We had previously attempted to insert additional DNA into the US-region of the 
2 jim plasmid and maintain its high inherent plasmid stability. In the 2|jjn-family 
disintegration plasmid pSAC300 3 a Ll-kb DNA fragment containing the URA3 

15 gene was inserted into itagi-site between REP2 and FLP in US-region in such a 
way that transcription from the URA3 gene was in saine direction as REP2 
transcription (see EP 0 286 424). When S150-2B [cir°] was transformed to uracil 
prototrophy by pSAC30Q, it was shown to be considerably less stable (50% 
plasmid loss m under 30 generations) than comparable vectors with URA3 inserted 

20 into the UL-region of 2{im (0-10% plasmid loss in under 30 generations) (Chinery 
& Hinchliffe, 19S9, Curr. Genet., 16, 21; BP 0 286 424). Thus, insertion at the Eagl 
site may have interfered with FLP expression and it was concluded that the 
insertion position could have a profound effect upon the stability of the resultant 
plasmid a conclusion confirmed by Bijvoet et aL 3 1991, Yeast, 7, 347* 

25 

It is desirable to insert further polynucleotide sequences into 2jim-family 
plasmids. For example, the insertion of polynucleotide sequences that encode host 
derived proteins, recombinant proteins, or non-coding antisense or KNA 
interference (RNAi) transcripts may be desirable. Moreover, it is desirable to 
30 introduce multiple further polynucleotide sequences into 2|im~family plasmids, 
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thereby to provide a plasmid which encodes, for example, multiple separately 
encoded multi-subunit proteins, difiEferemt members of the same metabolic 
pathway P additional selective markers or a recombinant protein (single or multi- 
Bubirait) and a chaperone to aid the expression of the recombinant protein. 

5 

However, the 6,318-bp 2^m plasmid, tod other 2pm-family plasmids, ate 
crowded with functional genetic elements (Sutton & Broach, 1985, MoL Cell 
Biol, 5, 2770; Broach et al, 1979, Cell \6 9 S27), with no obvious positions 
e7dsting for the insertion of additional DNA sequences without a concomitant loss 
10 in plasmid stability. In fact, except for the region between the origin of replication 
and the D gene locus, the entire 2\xm plasmid genome is transcribed into at least 
one poly(A) + species and often more (Sutton & Broach, 1985, MoL Cell BioL 9 5, 
2770). Consequently, most insertions might be expected to have a detrimental 
impact on plasmid function in vivo. 

15 

Indeed, persons skilled in the art have given up on inserting heterologous 
polynucleotide sequences into 2(im-fami!y plasmids, 

Robinson et al, 1994, Bio/Technohgy, 12, 381-384 reported that a recombinant 
20 additional PDI gene copy in Saccharontyces cerevisiae could be used to increase 
the recombinant expression of human platelet derived growth factor (PDGF) B 
homodimer by ten-fold and Schizosacharomyces pornbe acid phosphatase by four- 
fold. Robinson obtained the observed increases in expression of PDGF and & 
pombe acid phosphatase using m additional chromosomally integrated PDI gene 
25 copy. Robinson reported that attempts to use the multi-copy 2pm expression 
vector to increase PDI protein levels had had a detrimental effect on heterologous 
protein secretion. 

Shusta et al s 1998, Nature Biotechnology, 16, 773-777 described the recombinant 
30 expression of single-chain antibody fragments (scFjv) in Saccharomyces 
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cerevisiae. Shusta reported that in yeast systems, the choice between integration 
of a transgene into the host chromosome versus the use of episoornal expression 
vectors can greatly affect secretion and, with reference to Parefch & Wittrup, 1997, 
BiotechnoL Prog., 13, 117-122, that stable integration of the scFv gene into the 

5 host chromosome using a 8 integration vector was superior to the use of a 2]xm- 
based expression plasmid. Parefch & Wittrup, op. cit, ha^i previously taught that 
the expression of bovine pancreatic trypsin inhibitor (BFTl) was increased by an 
order of magnitude using a 6 integration vector rather than a 2Mm-based 
expression plasmid. The 2 Jim-based expression plasmid was said to be comxter- 

10 productive for the production of heterologous secreted protein. 

Bao et at, 2000, Yeast, 16, 329-341, reported that the KIPDI1 gene had been 
introduced into -ST. tactis an a multi-copy plasmid, pKan707, and that the presence 
of the plasmid caused the strain to grow poorly. In the light of the earlier findings 
15 inBaoef ^/,2000,Bao&Fulaihara 3 2001 J Gene,272, 103-110, chose to introduce 
a single duplication of KIPDII on the host chromosome, j 

Accordingly, the art teaches the skilled person to integrate iransgenes into the 
yeast chromosome, rather then into a multicopy vector- There is, therefore, a need 
20 for alternative ways of transforming yeast. 

DESCMPTION OF THE INVENTION 

The present invention relates to recombinantly modified versions of 2jjm-family 
25 plasmids. 

A 2jLim'family plasmid is a circular, double stranded, DNA plasmid. It is typically 
small, such as between 3,000 to 10,000 bp, preferably between 4,500 to 7000 bp, 
excluding recombinantly inserted sequences. Preferred 2iam-family plasmids for 
30 use in the present invention pgjnprise sequences derived from one or more of 

7 ! 
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plasmids pSRl* pSB3 ? or pSE4 as obtained from Zygosaccharomyces rouxii, 
pSBl or pSB2 both as obtained from Zygosaccharomyces bailli, pSMl as obtained 
from Zygosaccharomyces ferrnentati, pKDl as obtained from Kluyveramyces 
drosophilanon, pPMl as obtained from Pichia membranaefaciens and the 2p.ni 
plasmid and variants (such as Scpl 3 Scp2 and Scp3) as obtained from 
Sacckaromyces cerevisiae, for example as described in Volkert et al 9 1989, 
Microbiological Reviews, 53(3), 299-317, Murray ef al, 1988, Mol Biol, 200, 
601-607 and Painting, et aL y 1984, J. Applied Bacteriology, 56, 331. 



10 A 2^m-family plasjnid is capable of stable multicopy maintenance within a yeast 
papulation, although not necessarily all 2 jjim- family plasmids will be capable of 
stable multicopy maintenance within all types of yeast population. For example, 
the 2\im plasmid is capable of stable multicopy maintenance, inter alia, within 
Sacckaromyces czrevisiae and Sacckaromyces carlsbergensis. 

15 

By "multicopy maintenance" we mean that the plasmid is present in multiple 
copies within each yeast cell. A yeast cell comprising 2 (im- family plasmid is 
designated [cir*], whereas a yeast cell that does not comprise 2|xm-fafflily plasmid 
is designated [cir°], A [ci/] yeast cell typically comprises 1 10-100 copies of 2jim- 

20 family plasmid per haploid genome, such as 20-90, more typically 30-80* 
preferably 40-70, more preferably 50-60 copies per haploid genome. Moreover, 
the plasmid copy number can be affected by the genetic background of the host 
which can increase the plasmid copy number of 2pm-Hke plasmid to above 100 
per haploid genome (Gerbaud and Guerineau, 1980, Curr. Genetics, 1, 219* Holm, 

25 1982, Cell, 29, 585, Sleep et ah, 2001, Yeast, IS, 403 and WO99/005O4). 
Multicopy stability is defined below. . 

A 2 Jim-family plasmid typically comprises at least three open reading frames 
("OBJFs") that each encodes a protein that functions in the stable maintenance of 
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the 2pm-family plasmid as a multicopy plasmid. The proteins encoded by the 
three ORFs can be designated FLP 3 REP1 and REP2. Where a 2|im~family 
plasmid comprises not all three of the ORFs encoding FLP, KEPI and REP2 then 
ORFs encoding the missing protein(s) should be supplied in trans 5 either on 
5 another plasmid or by chromosomal integration. 

A "FLP M protein is a protein capable of catalysing the site-specific recombination 
between inverted repeat sequences recognised by FLP. The inverted repeat 
sequences axe termed FLP recombination target (FRT) sites and each is typically 

JO present 35 part of a larger inverted repeat (see below)- Preferred FLP proteins 
comprise the sequence of the FLP proteins encoded by one of plasmids pSRl, 
pSBlj pSB2 ? pSB3„ pSB4 ? pSMl, pKDl 3 pPMl and the 2|im plasmid, for example 
as described in Volfcert et at* op. ciL, Murray et al, op. cii and Painting et al, op. 
ciL Variants and fragments of these FLP proteins are also included in the present 

15 invention. "Fragments" and '"variants" are those which retain the ability of the 
native protein to catalyse the site-specific recombination between the same FRT 
sequences. Such variants and fragments will usually have at least 50%, 60%* 70% > 
80%, 90% 3 95% 3 9S% ? 99% y or more, homology with an FLP protein encoded by 
one of plasmids pSRl, pSBl, pSB2 ? pSB3 ? pSMl a pKDl and the 2|im plasmid 

20 Different FLP proteins can have different FRT sequence Specificities. A typical 
FRT site may comprise a core nucleotide sequence flanked by inverted repeat 
sequences. In the 2fim plasmid, the FRT core sequence is S nucleotides in length 
and the flanking inverted repeat sequences are 13 nucleotides in length (Volkext et 
al 7 op. ciL). However the FRT site recognised by any given FLP protein may b© 

25 different to the 2pm plasmid FRT site* 

RBP1 arid REP2 are proteins involved in the partitioning of plasmid copies during 
cell division, and may also have a role in the regulation of FLP expression. 
Considerable sequence divergence has been observed between REP1 proteins 

30 from different 2pm-family plasmids, whereas no sequence alignment is currently 

9 i 
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possible between REP2 proteins derived from different 2{im-family plasmids, 
Preferred REP1 and REF2 proteins comprise the sequence of the REP1 and REP2 
proteins encoded by one of plasmids pSRl, pSBl, pSB2, pSB3> pSB4, pSMl, 
pKDl, pPMl and the 2j4m plasmid, for example as described in Volkert et al 7 op. 
5 ciL, Murray et al> op. cit and Painting et ah op. cit Variants and ftagmcnts of 
these REP1 and RBF2 proteins are also included in the present invention, 
fragments" and "Variants" of REP1 and REP2 are those which, when encoded by 
the plasmid in place of the native ORF, do not disrupt the stable multicopy 
maintenance of the plasnaid within a suitable yeast population. Such variants and 
10 fragments of REP1 and REP2 will usually have at least 5%l 10%, 20%, 30%, 40%, 
50%, 60%, 70%, 80%, 90%, 95%, 98%, 99%, or more, homology with a REP1 and 
REP2 protein, respectively, as encoded by one of plasmids pSRl, pSBl, pSB2, 
pSB3, pSB4, pSMl, pKDl, pPMl and Uie 2jjin plasmid. 

15 The REP1 and REP2 proteins encoded by the ORFs on the plasmid must be 
compatible, REP1 and REP2 are compatible if they contribute in combination 
with the other functional elements of the plasmid, towards the stable multicopy 
maintenance of the plasmid which encodes them. Whether or not a REP1 and 
REP2 ORF contributes towards the stable multicopy maintenance of the plasmid 

20 which encodes them can be determined by preparing mutants of the plasmid in 
which each of the EJEP1 and REP2 ORFs are specifically disrupted. If the 
disruption of an ORF impairs the stable multicopy maintenance of the plasmid 
then the ORF can be concluded to contribute towards the stable multicopy 
maintenance of the plasmid in the non-mutated version. It is preferred that the 

25 REP1 and REP2 proteins have the sequences of REP1 and REP2 proteins encoded 
by the same naturally occurring 2^m-family plasmid, such as pSRl, pSBl, pSB2, 
pSB3, pSB4, pSMl, pKDl, pPMl and the 2^m plasmid, or variant or fragments 
thereof. 
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A 2 pin-family plasmid comprises two inverted repeat sequences. The inverted 
repeats may be any size, so long as they each contain an FRT site (see above). 
The inverted repeats are typically highly homologous. They may share greater 
than 50% 60%, 70%, 80%, 90%, 95%, 96%, 97%, 9&% 99%, 99.5% or more 

5 sequence identity. In a preferred embodiment they are identical. Typically the 
inverted repeats are each between 200 to 1000 bp in length. Preferred inverted 
repeat sequences roay each have a length of from 200 to 300 bp, 300 to 400 bp, 
400 to 500 bp 3 500 to 600 bp, 600 to 700 bp, 700 to 800 bp, 800 to 900 bp, or 900 
to 1000 bp. Particularly preferred inverted repeats are those of the plasmids pSRl 

10 (959 bp), pSBl (675 bp), pSB2 (477 bp), pSB3 (391 bp), ; pSMl (352 bp) ft pKDl 
(346 bp), the 2pm plasmid (599 bp), pSB4 and pPMl . 

The sequences of the inverted repeats may be varied. However, the sequences of 
the FRT site in each inverted repeat should be compatible with the specificity of 

15 the FLP protein encoded by the plasmid, thereby to enable the encoded FLP 
protein to act to catalyse the site-specific recombination between the inverted 
repeat sequences of the plasmid. Recombination between inverted repeat 
sequences (and thus the ability of the FLP protein to recognise the FRT sites with 
the plasmid) can be determined by methods known in the art. For example, a 

2Q plasmid in a yeast cell under conditions that favour FLP expression can be assayed 
for changes in the restriction profile of the plasmid which would result from a 
change in the orientation of a region of the plasmid relative to another region of 
the plasmid. The detection of changes in restriction profile indicate that the FLP 
protein is able to recognise the FRT sites in the plasmid and therefore that the FRT 

25 site in each inverted repeat are compatible with the specificity of the FLP protein 
encoded by the plasmid. 



In a particularly preferred embodiment, the sequences of inverted repeats, 
including the FRT sites, are derived from the same 2p.m-family plasmid as the 
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ORF encoding the FLP protein, such as pSRl, pSBl, pSB^, pSB3, pSB4, pSMl, 

pKDl., pPMl or the 2jumi plasmid. i 

i 

i 

The inverted repeats are typically positioned within the Ijim-family plasmid such 
5 that the two regions defined between the inverted repeats (e.g. such as defined as 
UL and US in Hie 2|im plasmid) are of approximately similar size, excluding 
exogenously introduced sequences such as transgenes. For example, one of the 
two regions may have a length equivalent to at least 40%, 50%, 60% 3 70% 9 80%, 
90%, 95% or more, up to 100% ? of the length of the other region. 

10 

A 2|im-family plasmid comprises the ORF that encodes FLP and one inverted 
repeat (arbitrarily termed "IR1 M to distinguish it from the other inverted repeat 
mentioned in the next paragraph) juxtaposed in such a manner that IR1 occurs at 
the distal end of the FLIP ORF, without any intervening coding sequence, for 
15 example as seen in the 2 jam plasmid. By "distal end' 5 in this context we mean the 
end of the FLP ORF opposite to the end from which the promoter initiates its 
transcription. In a preferred embodiment, the distal end of the FLP ORF overlaps 
withlRl. 

20 A 2pm-family plasmid comprises the ORF that encodes REP2 and the other 
inverted repeat (arbitrarily termed CC IR2" to distinguish it from IR1 mentioned in 
the previous paragraph) juxtaposed in such a manner that IR2 occurs at the distal 
end of the REP2 ORF, without any intervening coding sequence, for example as 
seen in the 2^rn plasmid. By "distal end" in this context we mean the end of the 

25 REP2 ORF opposite to the end from which the promoter initiates its transcription. 

In one embodiment, the ORFs encoding REP2 and FLP may he present on the 
same region of the two regions defined between the inverted repeats of the 2jom- 
family plasmid, which region may be the bigger or smaller of the regions (if there 

30 is any inequality in size between the two regions). j 

12 i 

I 
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Irt one embodiment, the ORFs encoding REP2 and FLP may be transcribed from 
divergent promoters- 

5 Typically, the regions defined between the inverted repeats (e.g. such as defined as 
UL and US in the 2 (am plasmid) of a 2jim-family plasinid may comprise not more 
than two endogenous genes that encode a protein that functions in the stable 
maintenance of the 2 jam-family plasmid as a multicopy plasmid. Thus in a 
preferred embodiment, one region of the plasmid defined between the inverted 
10 repeats may comprise not more than the ORFs encoding FLP and REP2; FLP and 
REPl; or REPl and REP2 3 as endogenous coding sequence. 

A 2 jam-family plasmid comprises an origin of replication (also known as an 
autonomously replicating sequence - "ARS"), which is typically bidirectional. 
15 Any appropriate ARS sequence can be present. Consensus sequences typical of 
yeast chromosomal origins of replication may be appropriate (Broach et al, 1982, 
Cold Spring Harbor Symp. Quant. Biol, 47 s 1165-1174; Williamson, Yeast, 1985, 
1, 1-14). Preferred ARSs include those isolated from pSRl, pSBl, pSB2 9 pSB3, 
pSB4, pSMl, pKDl, pPMl and the 2jam plasmid. 

20 

Thus, a 2nm-family plasmid typically comprises at least ORFs encoding FLP and 

REP2, two inverted repeat sequences each inverted repeat comprising an FRT site 

compatible with FLP protein, and an ARS sequence. Preferably the plasmid also 

comprises an ORF encoding REPl, although it may be supplied in trans, as 

25 discussed above. Preferably the FRT sites are derived from the same 2nm-family 

plasmid as the sequence of the encoded FLP protein. Preferably the sequences of 

the encoded REPl and REP2 proteins are derived from the same 2jxm-fexnily 

plasmid as each other. More preferably, the FRT sites are derived from the same 

2Mm-family plasmid as the sequence of the encoded FLP, REPl and REF2 

30 proteins. Even more preferably, the sequences of the ORIfs encoding FLP, REPl 

13 

i 
j 
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and REP2, and the sequence of the inverted repeats (including the FRT sites) are 
derived from the same 2pm-family plasmid. Yet more preferably, the ARS site is 
obtained from the same 2um-famiry plasmid as one or more of the ORFs of FLP, 
REP1 and REF2, and the sequence of the inverted repeats (including the FRT 
5 sites). Preferred plasmids include plasmids pSRl, pSB3jand pSB4 as obtained 
from Zygosacckaromyces rouxii, pSBl or pSB2 both as obtained from 
Zygosacckaromyces bailli, pSMl as obtained from Zygosacckaromyces 
fermentatt, pJCDl as obtained from Klvyveromyc.es drosophilarum, pPMl as 
obtained from Pichia m embranaefaciens, and the 2pm plasmid as obtained from 
10 Saccharomyces cerevisiae, fbt example as described in Volkert et al> 1989, op. 
cit, Muiray et al, op, cit. and Painting et al, op. cit. 

Optionally, a 2 pm-family plasmid may comprise a region equivalent to the STB 
region (also known as REP3) of the 2pm plasmid, as defined in Volkert et al, op. 

15 cit. The SIB region in a 2pm-faijriily plasmid pf the invention may comprise two 
or more tandem Tepeat sequences, such as three, four, five or more. Alternatively, 
no tandem repeat sequences may be present. The tandem repeats may be any size, 
such as 10, 20, 30, 40, 50, 60 70, 80, 90, 100 bp or more" in length. The tandem 
repeats in the STB region of the 2pm plasnrid are 62 bp in length. It is not 

20 essential for the sequences of the tandem repeats to be identical. Slight sequence 
variation can be tolerated. It may be preferable to select an STB region from the 
same plasmid as either or both of the KEPI and REP2 OBlFs. The STB region is 
thought to be a e to-acting element and preferably is not transcribed. 

25 Optionally, a 2pm-family plasmid may comprise an additional OKF that encodes a 
protein that functions in the stable maintenance of the 2um-famUy plasmid as a 
multicopy plasmid. The additional protein can be designated RAF or D. ORFs 
encoding the RAF or D gene can be seen on, for example,, the 2pm plasmid and 
pSMl. Thus a RAF or D ORF can comprise a sequence; suitable to encode the 

14 
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protein product of the RAF or D gene ORFs encoded by the 2|um plasmid or 
pSMl s or variants and fragments thereof. Thus variants! and fragments of the 
protein products of the RAF or D genes of the 2|im plasmid or pSMl axe also 
included in the present invention. 'Tra^eaats" and 'Variants" of the protein 
5 products of the RAF or I> genes of the 2pm plasmid or pSMl are those which, 
when encoded "by the 2(im plasmid or pSMl in place of the native ORF, do not 
disrupt the stable multicopy maintenance of the plasmid within a suitable yeast 
population, Such variants and fragments will usually have at least 5%, 10%, 20%, 
30% 40%, 50% 60% 3 70%, S0%, 90%, 95%, 98%, 99%, or more, homology with 
10 the protein product of the RAF or D gene ORFs encoded by the 2\im plasmid or 
pSMl. 



The present invention provides a 2jim-family plasmid comprising a polynucleotide 
sequence insertion, deletion and/or substitution between the first base after the last 
15 functional codon of at least one of either a REP2 gene or an FLP gene and the last 
base before the FRT site in an inverted repeat adjacent to said gene. 

i 

A polynucleotide sequence insertion is any additional polynucleotide sequence 
inserted into the plasmid. Preferred polynucleotide sequence insertions are 

20 described below. A deletion is removal of one or mone base pairs, such as the 
removal of up to 2, 3, 4, 5, 6, 7, B, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 
300, 400 3 500, 600, 700, 800, 900, 1000 or more base pairs, which may be as a 
single contiguous sequence or from spaced apart regions within a DNA sequence. 
A substitution is the replacement of one or more base pairs, such as the 

25 replacement of up to 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 
200, 300, 400, 500, 600, 700, 800, 900, 1000 or more base pairs, which may be as 
a single contiguous sequence or from spaced apart regions within a DNA 
sequence. It is possible for a region to be modified by any two of insertion, 
deletion or substitution, or even all three. 

30 

15 i 
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The last functional codon of either a REP2 gene or a FLP gene is the codon in the 
open reading frame of the gene that is furthest downstream from the promoter of 
the gene whose replacement by a stop codon will lead to an unacceptable loss of 
multicopy stability of the plasmid., when determined by a test such as defined in 
5 Chinery & ffincHiffe (1989, Curr. Genet, 16, 21-25). For yeast that do not grow 
in the non-selective media (YPD 5 also designated YEPD) defined in Chinery & 
Hinchliffe (19S9, Curr. Genet., 16, 21-25) other appropriate non-selective media 
might be used. Plasmid stability may be defined as the percentage cells remaining 
prototrophic for the selectable marker after a defined number of generations. The 

10 number of generations will preferably be sufficient to show a difference between a 
control plasmid, such as pSAC35 or pSAC310, or to shown comparable stability 
to such a control plasmid. The number of generations may be 1, 2, 3, 4, 5, 6, 7, 8, 
9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30 5 35, 40, 45, 50, 60, 70, 80, 90, 
100 or more. Higher numbers are preferred. The acceptable plasmid stability 
15 might be 1%, 2%, 3%, 4%, 5%, 10%, 15%, 20% ? 25%, 30%, 40%, 50%, 60%, 
70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, ,99.9% cwr substantially 
100%. Higher percentages are preferred. The skilled person will appreciate that, 
even though a plasmid may have a stability less than 100% when grown on non- 
selective media, that plasmid can still be of use when cultured in selective media, 

20 For example plasmid pDB271 1 as described in the examples is only 10% stable 

when the stability is determined accordingly to Example 1, but provides a 15-fold 

* 

increase in recombinant transferrin productivity in shake flask culture under 
selective growth conditions. 

25 Thus, disruption of the REP2 or FLP genes at any point downstream of fee last 
functional codon in either gene, by insertion of a polynucleotide sequence 
insertion, deletion or substitution will not lead to an unacceptable loss of 
multicopy stability of the plasmid. We have surprisingly found that the REP2 
gene of the 2\xm plasmid can be disrupted after codon 59 and that the FLP gene of 
30 the 2fxm plasmid can be disrupted after codon 344, each iwithout leading to an 

16 '! 
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unacceptable loss qf multicopy stability of the plasmid. The last functional codon 
in equivalent genes in other 2pm-family plasmids can be determined routinely by 
modifying the relevant genes and detennining stability as described above. 

5 The REP2 and FLP genes in a 2 \xm plasmid of the invention each have an inverted 
repeat adjacent to them. The inverted repeat can be identified because it matches 
the sequence of another inverted repeat within the same plasmid. By "adjacent" is 
meant that the FLP or REP2 gene and its inverted repeat are juxtaposed in such a 
maimer that the inverted repeat occurs at the distal end of the gene, without any 
10 intervening coding sequence, for example as seen in the 2ynxi plasmid. By "distal 
end" in this eontejst we mean the end of the gene opposite! to the end from which 
the promoter initiates its transcription. In a preferred embodiment, the distal end 
of the gene overlaps with the inverted repeat. 

is In a first preferred aspect of the invention, the polynucleotide sequence insertion, 
deletion and/or substitution occurs between the first base after the last functional 
codon of the REP2 gene and the last base before the FRT sib in an inverted repeat 
adjacent to said gene, preferably between the first base of the inverted repeat and 
the last base before the FRT site, even more preferably at a position after the 
20 translation termination codon of Ihe REP2 gene and before the last base before the 
FRT site. 

In a second preferred aspect of the invention, the polynucleotide sequence 
insertion, deletion and/or substitution occurs between the first base after the last 
25 functional codon of the FLP gene and the last base before the FRT site in an 
inverted repeat adjacent to said gene, preferably between the first base of the 
inverted repeat and the last base before Hie FRT site, more preferably between the 
first base after the end of the FLP coding sequence and the last base before the 
FRT site, such as at the first base after the end of the FLP coding sequence. 
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la one embodiment, other than the polynucleotide sequence insertion, deletion 
and/or substitution, the FLP gene and/or die REP2 gene has the sequence of a FLP 
gene and/or a REP2 gene, respectively, derived from a naturally occurring 2pm- 
family plasmid. 

The tenn ''derived from" includes sequences having an identical sequence to the 
sequence from which they are derived. However* variants; and fragments thereof, 
as defined above, are also included. For example, an FLP gene having a sequence 
derived from the FLP gene of the 2um plasmid may have a modified promoter or 

10 other regulatory sequence compared to that of the naturally occurring gene. 
Alternatively, an FLP gene having a sequence derived firdm the FLP gene of the 
2jjm plasmid may have a modified nucleotide sequence in the open reading frame 
which may encode the same protein as the naturally occurring gene, or may 
encode a modified FLP protein. The same considerations apply to REP2 genes 

is having a sequence derived from a particular source. 

A naturally occurring 2nm-famUy plasmid is any plasmid having the features 
defined above as being essential features for a 2|jm-famiiy plasmid, which 
plasmid is found to naturally exist in yeast, i.e. has not been recombinant!/ 

20 modified to include heterologous sequence. Prefeiably the naturally occurring 
2um-family plasmid is selected from pSRl (Accession No. X02398), pSB3 
(Accession No. X02608) or pSB4 as obtained from Zygosaccharomyces rouxii y 
pSBl or pSB2 (Accession No. NC 002055 or M1S274) boUi as obtained from 
Zygosaccharomyces bailK pSMl (Accession No. NC_002054) as obtained from 

25 Zygosaccharomyces fermentati, pKDl (Accession No. X03961) as obtained from 
Kluyveromyces drosophilarum, pPMl as obtained from Pickia membranaefaciens, 
or, most preferably, the 2fim plasmid (Accession No. NC_001398 or J01347) as 
obtained from Saccharomyces cerevisiae. Accession numbers refer to deposits at 
theNCBI. 



30 
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Preferably, other than the polynucleotide sequence insertion, deletion and/or 
substitution, the sequence of the inverted repeat adjacent to said FLP and/or REP2 
gene is derived from the sequence of the corresponding inverted repeat in the same 
naturally occurring 2 pin-family plasmid as the sequence from which the gene is 
5 derived. Thus, for example, if the FLP gene is derived from the 2pm plasmid as 
obtained from S- cerevisiae* then it is preferred that the inverted repeat adjacent to 
the FLP gene has a sequence derived from the inverted repeat that is adjacent to 
Ihe FLP gene in the 2pm plasmid as obtained from & cepevtsiae. If the REP2 
gene is derived from the 2 jam plasmid as ohtained from SL cerevisiae, then it is 
10 preferred that the inverted repeat adjacent to the REP2 gene has a sequence 
derived from the inverted repeat that is adjacent to ihe REP2 gene in the 2jim 
plasmid as obtained from & cerevisiae. 

Where, in the first preferred aspect of the invention, other than the polynucleotide 
15 sequence insertion, deletion and/or substitution, the REP2 1 gene and the inverted 
repeat sequence have sequences derived from the corresponding regions of the 
2nm plasmid as obtained from S. cerevisiae* then it ;is preferred that the 
polynucleotide sequence insertion, deletion and/or substitution occurs at a position 
between the first base of codon 59 of the REP gene and the last base before the 
20 FRT site in the adjacent inverted repeat, more preferably at k position between the 
first base of the inverted repeat and the last base before the FRT site, even more 
preferably at a position after the translation termination co&on of the REP2 gene 
and before the last base before the FRT site, such as at the first base after the end 
of the REF2 coding sequence. 

Where, other than the polynucleotide sequence insertion, deletion and/or 

substitution, the SJSP2 gene and the inverted repeat sequence have sequences 

derived from the corresponding regions of the 2|J.m plasmid as obtained from S~ 

cereviziae, then in one embodiment, other than the polynucleotide sequence 

30 insertion ? deletion and/or substitution;, the sequence of the REP2 gene and the 

19 
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adjacent inverted repeat is as defined by SEQ ID NO: 1 or variant thereof. In SEQ 
ID NO:l, the first base of codon 59 of the REP2 gene jfe represented by base 
number 175 and the last base before the FRT site is represented by base number 

1216. The FRT sequence given here is the 55-base-pair sequence from Sadowsfci 
5 et al, 1986, pp7-10, Mechanisms of Yeast Recombination (Current 
Communications in Molecular Biology) CSHL. Ed. Klar.'A. Strathem, J. N. In 
SEQ ID NO:!, the first base of the inverted repeat is represented by base number 
887 and the first base after the translation termination codon of the REP2 gene is 
represented by base number 892. 

i 

10 

In an even more preferred embodiment of the first aspect of the invention, other 
than the polynucleotide sequence insertion, deletion and/or substitution, the REP2 
gene and the inverted repeat sequence have sequences derived from the 
corresponding regions of the 2yam plasmid as obtained from S. cerevisiae and, in 

15 the absence of the interruption the polynucleotide sequence insertion, deletion 
and/or substitution, comprise an Xcrnl site or an Fspl site within the inverted 
repeat and the polynucleotide sequence insertion, deletion and/or substitution 
occurs at the XcmX site, or at the Fspl site. In SEQ ID NO:l, tiie Xcml site is 
represented by base numbers 935-949 and the Fspl site is represented by base 

20 numbers 1172-1177. 

i 

"Where, in the second preferred aspect of the invention, other than the 
polynucleotide sequence insertion, deletion and/or substitution, the FLP gene and 
the adjacent inverted repeat sequence have sequences derived from the 
25 corresponding regions of the 2um plasmid as obtained from S. cerevisiae, then it is 
preferred that the polynucleotide sequence insertion, deletion and/or substitution 
occurs at a position between the first base of codon 344 of the FLP gene and the 
last base before the FRT site, more preferably between the first base of the 
inverted repeat and the last base before the FRT site, yet more preferably between 

20 
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the first base after the end of the FLP coding sequence and|fce last base before the 
ERT site, such as at the first base after the end of the FLP coding sequence. 

Where, other than the polynucleotide sequence insertion, deletion and/or 
substitution, the FLP gene and the adjacent inverted repeat sequence have 
sequences derived from the corresponding regions of the 2 urn plasmid as obtained 
from S. cerevisiae, then in one embodiment, other than the polynucleotide 
sequence insertion, deletion and/or substitution, the sequence of the FLP gene and 
the inverted repeat that follows the FLP gene is as defined by SEQ ID NOS or 
variant thereof. In SEQ ID NO:2, the first base of codon 344 of the FLP gene is 
represented by base number 1030 and the last base before the FRT site is 
represented by base number 1419, the first base of the inverted repeat is 
represented by base number 1090, and the first base after the end of the FLP 
coding sequence is represented by base number 1273. 

In an even more preferred embodiment of the second preferred aspect of the 
invention, other than the polynucleotide sequence insertion, deletion and/or 
substitution, the FLP gene and the adjacent inverted repeat sequence have 
sequences derived from the corresponding regions of the 2)xm plasmid as obtained 
from S. cerevisiae and, in the absence of the polynucleotide sequence insertion, 
deletion and/or substitution, comprise an Hgal site or an Fspl site within the 
inverted repeat and the polynucleotide sequence insertion, deletion and/or 
substitution occurs at the cut formed by the action of Hgal on the Hgal site (Hgal 
cuts outside the 5bp sequence that it recognises), or at the Fspl. In SEQ ID NOi2, 
the Hgal site is represented by base numbers 1262-1266 and the Fspl site is 
represented by base numbers 1375-1380, 

The skilled person will appreciate that the features of the plasmid defined by the 
first and second preferred aspects of the present invention are not mutually 
exclusive. Thus, a plasmid according to a third preferred aspect of the present 

21 
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invention may comprise polynucleotide sequence insertions, deletions and/or 
substitutions between the first bases after the last functional codons of both of the 
SEP2 gene and the FLP gene and the last bases before the FRT sites in the 
inverted repeats adjacent to each of said genes, which polynucleotide sequence 
insertions, deletions and/or substitutions can be the same or different. For 
example, a plasmid according to a third aspect of the present invention may, other 
than the polynucleotide sequence insertions, deletions and/or substitutions, 
comprise the sequence of SEQ ID NO. l or variant thereof and the sequence of 
SEQ ID NO:2 or variant thereof, each comprising a polynucleotide sequence 
insertion, deletion ano/or substitution at a position as defined above for the first 
and second preferred aspects of the invention, respectively. 



10 



The skilled person will appreciate that the features of the plasmid defined by the 
first, second and third preferred aspects of the present invention do not exclude the 
is possibility of the plasmid also having other sequence modifications. Thus, for 
example, a 2um-family plasmid of the first, second and thiid preferred aspects of 
the present invention may additionally comprise a polynucleotide sequence 
insertion, deletion and/or substitution which is not at a position as defined above. 
Accordingly, the plasmid may additionaUy carry transgenes-at a site other than the 
20 insertion sites of the invention. 



25 



Alternative insertion sites in 2|im plasmids are known in the art, but do not 
provide the advantages of using the insertion sites defined by the present 
invention. Nevertheless, plasmids which already include a polynucleotide 
sequence insertion, deletion and/or substitution at a site known in the art can be 
further modified by making one or more further modifications at one or more of 
the sites defined by the first, second and third preferred aspects of the present 
invention. The skilled person will appreciate that^ as discussed in the introduction 
to this application, there are considerable technical limitations placed on the 
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insertion of transgenes at sites of 2um-family plasmids other than as defined by 
the first and second aspects of the invention. 

Typical modified 2um plasmids known in the art include those described in Rose 

5 & Broach (1990, Methods Enzymol, 185, 234-279), such as plasmids P CV19, 
P CV20, CV new which utilise an insertion at JJcoRI in FLP, Pounds P CV21, 
pGT41 and pYE which utilise EcoBl in D as the insertion site, plasmid P HKB52 
which utilises PstI m D as the insertion site, plasmid pJDB248 which utilises an 
insertion at PstI in D and EcoBl in D, plasmid pJDB219 in which PstI in D and 

10 EcoXl in FLP are used as insertion sites, plasmid G18, plasmid pABIS which 
utilises an insertion at OdL in FLP, plasmids pGT39 andpA3, plasmids pYTll, 
P YT14 and P YT11-LEU which use Pstl in D as file insertion site, and plasmid 
PTY39 which uses EcdSl in FLP as the insertion site. Other 2um plasmids 
include pSAC3, P SAC3U1, P SAC3U2, pSAC300, P SAC310, pSAC3Cl, 

15 pSAC3PLl, pSAC3SL4, and P SAC3SC1 are described in EP 0 286 424 and 
Chinery & Hmchliffe (1989, Curr. Genet., 16, 21-25) which also described Pstl, 
Eagl or SnaBl as appropriate 2um insertion sites. Further 2»m plasmids include 
P AYE255 S pAYE316, pAYE443, pAYE522 (Kerry-Williams et al 1998, Yeast, 
14, 161-169), pDB2244 (WO 00/44772) and P AYE329 (Sleep et al 2001, Yeast, 

20 18, 403-421). 

hi one preferred embodiment, a 2um-like plasmid as defined by the first, second 
and third preferred aspects of the present invention additionally comprises a 
polynucleotide sequence insertion, deletion and/or substitution which occurs 

25 within an nntranscribed region around the ARS sequence. For example, in the 
2um plasmid obtained ftom S. cerevisiae, the untranscribed region around the 
ARS sequence extends from end of the D gene to the beginning of ARS sequence. 
Insertion into Sna&l (near the origin of replication sequence ARS) is described in 
Chinery & Hmchliffe, 1989, Curr. Genet., 16, 21-25. The skilled person will 

30 appreciate that an additional polynucleotide sequence insertion, deletion and/or 
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substitution can also occur within fhe untranseribed region at neighbouring 
positions to the SnaBI site described by Chinery & HincMirre. 

A plasmid according to any of the first, second or third' aspects of the present 
5 invention may be a plasmid capable of autonomous replication in yeast, such as a 
member of the Saccharomyces, Ktuyveromyces, Zygosaccharomyces, or Pichia 
genus, such Saccharomyces cerevisiae, Saccharomyces carlsbergensis, 
Kluyveromyces lactis, Pichia pastoris and Pichia membranaefaciens, 
Zygosaccharomyces rouxii, Zygosaccharomyces bail if, Zygosaccharomyces 
10 fermentati, or Kluyveromyces drosphilarum. S. cerevisiae and S. carlsbergensis 
are thought to provide a suitable host cell for the autonomous replication of all 
known 2 jam plasmids. 

In a preferred embodiment, the, or at least one, polynucleotide sequence insertion, 
is deletion and/or substitution included in a 2um-family plasmid of the invention is a : 
polynucleotide sequence insertion. Any polynucleotide sequence insertion may be 
used, so long as it is not unacceptably detrimental to the stability of the plasmid, 
by which we mean Ihat the plasmid is at least 1%, 2%, 3%, 4%, 5%, 10%, 15%, 
20%, 25%, 30%, 40% 50%, 60%, 70%, 75%, S0%, 85%, 90%, 95%, 96%, 97%, 
20 98%, 99%, 99,9% or substantially 100% stable on non-selective media such as 
YEPD media compared to the unmodified plasmid, the latter of which is assigned 
a stability of 100%. Preferably, the above mentioned level; of stability is seen after 
separately culturing yeast cells comprising the modified and unmodified plasmids 
in a culture medium for one, two, three, four, five, six, seven, eight, nine ten, 11, 
25 12, 13, 14, 15, 16, 17, 18, 19 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100 or more 
generations. 

Where the plasmid comprises a selectable marker, higher levels of stability can be 
obtained when transformants are grown under selective conditions (e.g. in minimal 
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medium), since the medium can place a selective pressure on the host to retain the 
plasmid* 1 

Stability in non-selective and selective (e.g. minimal) media can be determined 
5 using the methods set forth above. Stability in selective media can be 
demonstrated by the observation that the plasmids can be used to transform yeast 
to prototrophy. 

Typically, the polynucleotide sequence insertion will be at least 4, 6 5 8, 10, 20, 30, 
10 40, 50, 60, 70 3 SO, 90, 100, 150, 200, 250, 300, 350, 400, 450, 500 or more base 
pairs in length, Usually, the polynucleotide sequence insertion will be up to lkb, 
2kb, 3kb, 4kb, 5kb, 6kb, 7kb 3 Skb, 9kb, lOfcb or more in length. The skilled 
person will appreciate that the 2|im plasmid of the present invention may comprise 
multiple polynucleotide sequence insertions at different sites within the plasmid. 
15 Typically, the total length of polynucleotide sequence insertions is no more than 
5kb, lOkb, 15kb, 20kb 5 25kb or 30kb although greater total length insertion may 
be possible. 

The polynucleotide sequence insertion may contain a transcribed region or may 
20 contain no transcribed region. A transcribed region may encode an open reading 
frmne, or may be non-coding. The polynucleotide sequence insertion may contain 
both transcribed and non-transcribed regions* 

j 

A transcribed region is a region of DNA that can be transcribed by RNA 
25 polymerase, typically yeast RNA polymerase. A transcribed region can encode a 
functional RNA molecule, such as ribosomal or transfer RNA or an RNA 
molecule that can function as an antisense or RNA linterference ("RNAi") 
molecule. Alternatively a transcribed region can encode a messenger RNA 
molecule (mRNA), which xnJBNA can contain an open reading frame (OKF) which 
30 can be translated in vivo to produce a protein. The term "protein" as used herein 

25 i 
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includes all natural and non-natural proteins, polypeptides and peptides. 
Preferably, the ORF encodes a heterologous protein. By ^'heterologous protein" 
we mean a protein that is not naturally encoded by a 2f.un-family plasmid. 
Preferably, therefore, the heterologous protein is not a FLP, REP1, REP2, or a 
5 RAF/D protein as encoded by any one of pSRl , pSB3 or pSB4 as obtained from Z. 
rouxii, pSBl or pSB2 both as obtained from Z. bailli, pSMl as obtained from Z 
fcrmentati, pKDl as obtained from K. drasophilarum, pPMl as obtained from P. 
membranaefaciens and the 2\xm plasmid as obtained from £ cerevisiae. 

io Where the polynucleotide sequence insertion encodes an open reading frame, then 
it may additionally comprise some polynucleotide sequence that does not encode 
an open reading frame (termed "nonnsoding region"'). 

Non-coding region in Hie polynucleotide sequence insertion may contain one oi- 
ls more regulatory sequences, operatively linked to the open reading frame, which 
allow for the transcription of the open reading frame and/or translation of the 
resultant transcript 

The term "regulatory sequence" refers to a sequence that modulates (i.e. 5 promotes 
20 or reduces) the expression (i.e., the transcription and/or translation) of an open 
reading frame to which it is operably linked. Regulatory regions typically include 
promoters, terminators, liposome binding sites and the like. The skilled person 
will appreciate that the choice of regulatory region will depend upon the intended 
expression system. For example, promoters may be constitutive or inducible and 
25 may be cell- or tissue-type specific or non-specific. 

Where the expression system is yeast, such as Saccharomyces cerevisiae, suitable 
promoters for S. cerevisiae include those associated with the PGKI gene, GAL1 or 
GALIO genes, TEF1, TEF2 7 PYKU PMA1, CYC1, PH05, TRP1, ADHh ADH2, 
30 the genes for glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate 
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decarboxylase, phosphofructoktnase ? triose phosphate isouierase, phosphoglucose 
Isornerase^ glucokinase, a-mating factor pheromone* a-mating factor pheromonftj 
the PRB1 promoter, the PRAI promoter, the GPD1 promoter, and hybrid 
promotes involving hybrids of parts of 5' regulatory regions with parts of 5' 
5 regulatory regions of other promoters or with upstream activation sites (e.g. the 
promoter of EP-A-258 067). 

Suitable transcription termination signals are well known in the art. Where the 
host cell is eukaryotic, the transcription termination signal is preferably derived 

io from the 3' flanking sequence of a eukaryotic gene, which contains proper signals 
for transcription termination and polyadcnylation. Suitable 3' flanking sequences 
rmy, for example, be those of the gene naturally linked to the expression control 
sequence used, Le. may correspond to the promoter. Alternatively, they may be 
different In that case, and where the host is a yeast, preferably S. cerevisiae^ then 

15 the termination signal of the SL cerevisiae ADH1 9 ADHlJCYCl, or PGK1 genes 
are preferred. 

In one embodiment, the favoured regulatory sequences in yeast, such as 
Saccharomyces cerevisiae, include: a yeast promoter (e.g. the Saccharomyces 
20 cerevisiae PRB1 promoter), as taught in EP 431 880; and a transcription 
terminator, preferably the terminator from Saccharomyces ADH1, as taught in EF 
60 057. 

It may be beneficial for the non-coding region to incorporate more than one DNA 
25 sequence encoding a translational stop codon, such as UAA, UAG or UGA, in 
order to minimise translational read-through and thus avoid the production of 
elongated, non-natural fusion proteins. The translation stop codon UAA is 
preferred. Preferably, at least two translation stop codons are incorporated. 
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The term "operably linked" includes within its meaning that a regulatory sequence 
is positioned within any non^coding region such that it forms a relationship with 
an open reading frame that permits the regulatory region to exert an effect on the 
open reading frame in its intended manner. Thus a regulatory region "operably 
linked" to an open reading frame is positioned in such a way that the regulatory 
region is able to influence transcription and/or translation of the open reading 
frame in the intended manner, under conditions compatible with the regulatory 
sequence. 

Where the polynucleotide sequence insertion as defined by the first, second or 
third aspects of the present invention includes an open reading frame that encodes 
a protein, then it may be advantageous for the encoded protein to be secreted. In 
that case, a sequence encoding a secretion leader sequence which, for example, 
comprises most of the natural HSA secretion leader, plus a small portion of the S. 
cerevisiae ct-mating factor secretion leader, as taught in WO 90/01063, may be 
included in the open reading frame. 

Alternatively, the encoded protein maybe intracellular. 

In one preferred embodiment, at least one polynucleotide sequence insertion as 
defined by the first, second or third aspects of the present invention includes an 
open reading frame comprising a sequence that encodes a yeast protein. In 
another preferred embodiment, at least one polynucleotide sequence insertion as 
defined by the first, second or third aspects of the present invention includes an 
open reading frame comprising a sequence that encodes a yeast protein from the 
same host from which the 2pm-Hfce plasmid is derived. 

In another preferred embodiment, at least one polynucleotide sequence insertion as 
defined by the first, second or third aspects of the present invention includes an 
open reading frame comprising a sequence that encodes a protein involved in 
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protein folding, or which has ebaperone activity or is involved in the unfolded 
protein response (Stanford Genome Database (SGD), http:://db.yeastgenome.org). 
Preferred proteins include AHA1, CCT2, CCTS, CCT4,'CCT5, CCT6, CCT7, 
CCT8, CNS1, CPR3, CPR6, EROl, EUG1, FMOl, HCH1, HSP10, HSP12, 

5 HSP104, BSP26, HSP30, HSP42, HSP60, HSP78, HSP82, JEMl, MDJ1, MDJ2, 
MPD1, MPD2, PDI1, PFD1, ABC1, APJ1, ATP1I, ATPI2, BTT1, CDC37, CNSI, 
CPR6, CPR7, HSC82, KAR2, LHS1, MGE1, MRS1I, NOBt, ECM10, SSA1, SSA2, 
SSA3,'sSA4, SSC2, SSE2, SILJ, SLS1, ORMU UBI4, ORM2, PERU PTC2, PSEI 
and HAC1 or a truncated fcitronless HACl (Vaflconen'ef at. 2003, Applied 

10 Environ. Micro. 69, 2065). 

A preferred chaperone is protein disnlphide isomerase (SDI) or a fragment or 
variant thereof having an equivalent ability to catalyse the fprmation of disnlphide 
bonds within ihe lumen of the endoplasmic reticulum (ER).' By <TDI» we inelude 
15 'any protein having the ability to reactivate the ribonuclease activity against KNA 
of scrambled ribonudease as described in EP 0 746 61 1 and Hillson et al, 1984, 
Methods Enzymol. 107, 281-292. 

Protein disnlphide isomerase is an enzyme which! typically catalyzes 
20 thiolrdisulphide interchange reactions, and is a major resident protein component 
of the EJL lumen in secretory cells. A body of evidence suggests that it plays a 
role in secretory protein biosynthesis (Freedman, 1984, Trends Biockem. Set, 9, 
438-41) and this is supported by direct cross-linking studies in situ (Roth and 
Pierce, 1987, Biochemistry. 26, 4179-82). The finding mat microsomal 
25 membranes deficient in PDI show a specific defect in cotranslational protein 
disnlphide formation (Bulleid and Freedman, 1988, Nature^ 335, 649-51) implies 
that the enzyme fractions as a catalyst of native disulpHdebond formation during 
the biosynthesis of secretory and cell surface proteins. This role is consistent with 
what is known of the enzyme's catalytic properties in vitro' it catalyzes thiol: 
30 disnlphide interchange reactions leading to net protein 'disulphide formation, 
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breakage or isomerization, and can typically catalyze mWein folding and the 
formation of native disulphide bonds in a wide variety, of reduced, unfolded 
protein substrates Redman etal., 1989, Biochem. Soc. Symp., 55, 167-192). The 
DNA and amino acid sequence of the enzyme is known for Seve ral species 
3 (Scherens m ah 1991, no*. 7, 185-193; Farqubar * al, 1991, ^ i 08j 81-89- 
EP07466I; EP0293793; EP0509841) and there is inching information on the' 
mechanism of action of the enzyme purified to homogeneity from mammalian 
liver (Oeighton et al, 1980, J. Mol. Biol., 142, 43-62; Freedman et al 19SS 
Biochem. Soc. Trans., 16, 96-9; Gilbert, 1989, Biochemistry, 28 7298-7305" 
Unidstrom and Holmgren, 1990, J. Biol. Ch em _, 265, 91 1^9120; Hawkins and 
Freedman, 1990, Biochem. J„ 275? 3 35-339). Of the many protein factor, 
cnn^tly implicated as mediators of protein folding, assembly and translocation in ' 
*e cell (Rothman, 1989, Cell, 59, 591-601), PDI has a Well-defined camlytic 
activity. 

v* 

PDI is readily isolated from mammalian tissues and the homLgeneous enzyme is a 
homodmaer (2x57 kD) with characteristically acidic pi (4.0-4.5) (Hfflso* et al 
1984, Methods Enzymol, 107, 281-292). Tne has ^ been from 

wheat and from the alga Cklamydomonas reinhardii (Kaska et al, 1990, Biochem 
J., 268, 63-68), rat (Edman et al, 1985, Nature, 317, 267-270), bovine (Yamanchi 

* * ***** 146, ,'1485-1492), human 

(Pndajamemi et al, 1987, 6 , 643-9), yeast (Scherens fl /, 

Farquhar * ^ ^ and chick (Parkkonen * a/, I98S, S«, C W 256 1005- 
1011). The proteins from these vertebrate species show 8 high degree of sequence 

c„tion throughout and ail show several overall features first noted in the rat 

PDI sequence (Edman et al., 1985, op. cit.). 



A ye ast protein disnlphide isomers parser, PDI1, can be found as Genbank 
accession no. CAA42373 or BAA00723. ft has the fbhowing sequence of 522 
30 ammo acids: 
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1 m^aagavla faqqaavap* dsawklatd Sf ^i, B bd lv la e«apw 

61 egtape yvkaaetlva knitlaqidc tenqdlcmeh nipgfpslkx fkusdv^xsx 
X21 dyegprtaea i-ttfndkopq pavawadlp ayla»e«vt pvxvqsgkxd .dfnatfy,™ 
«l a nkhf D dydf vaaanadddf klsiylpsam depwyngfck adiadadvfe kwlqvealpy 
24 x fgaidgsvfa qyvesglplg ylfyndeeel eeykplftel akk^gl^f vsidarkfgr 
box hagalnttksq fplSaihdmc adlkyglpql seeafdelsd kwleskaxe slvkdtlkgd 
361 aapxvksqei fe^daa^q Ivgkrihdeiv ndpkkdvlvl yyapwcghck rlaptyqela 
421 dtyauatsdv liakldHea* dvrgwiegy pt:ivlypggk kseawyqgs rsldslfdfx 
481 kenghfdvdg kalyesaqek aaeeadadae ladeedaxhd el 

An alternative PDI sequence can be found as Genbank accession no. CAA38402. 
It has fiie following sequence of 530 amino acids 

a attfsagavle W .lllu«r fagqaavapa daawklatd afneyiqshd Ivlaaffapw 
61 ashcknroape y.kaaefclv* k^tlaqide tenqdlcxnan nlKKp-Ud. ffcnrdvnnai 
i 3 i dyaHpxta.a ivqfttikqsq pavawadlp ayla^fvn pvivqagkid adfnatfysm 
Ml afikhffidydf atMMdUfi klsiylpsa™ dapwyagkk adiadadvfia kwlqvaalpy 
24i Egeidgsvfia qyveaglplg ylfjmdeeel eeykplftel akknrglimf vsidarfrfgr 
301 hagnlx^nkeq fplfaihdmt adlkyglpql seeafdelsd kivla B kai e alvkdflkgd 
3C1 a B pivksqai fenqdssvfq IvgknUdexv ndpkkd^lvl yyapwcghok rlaptyqela 
421 dtyanafcadv liakldhten dvrgwxegy ptx^lypggk kseswyqgs rsldslfdfx 
481 kenglifdvdg kalyeaaqek aaeeaeadae aaadadaela daadaiUdel 

Variants and fragments of die above PDI sequences, and variants of other naturally 
occurring PDI sequences are also included in the present invention. A "variant", in 
the context of PDI, refers to a protein wherein at one or more positions ihere have 
been amino acid insertions, deletions, or substitutions, either conservative or non- 
conservative, provided that such changes result in a protein whose basic properties, 
for example enzymatic activity (type of and specific activity), fhermostabiliry, 
activity in a certain pH-range (pH-stability) have not significantly been changed. 
"Significantly" in mis context means that one skilled in the art would say mat the 
properties of the variant may still be different but would not be unobvious over the 
ones of the original protein. 
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By "conservative substitutions" is teukd combinations such as Val, He Leu Ala, 
Me, Asp, Glu; Asn, Gin; Ser, Thr ? Gly, Ala; Lys, Arg, His; and ^T^Tzp. 
Weired conservative substations include Gly, Ala; Val, lie, Leu; Asp, Glu; Asu 
s Gln;Ser } Thr;Lys,Arg; andPhe,Tyr. 

A "variant" typically has at least 25%, at least 50%, at least 60% or at least 70% 
»ly at least 80%, moxe preferably at least 90%, even more preferably at least' 
95/o, yet more preferably at least 99%, most prefembly at,' least 99.5% Sequeilce 
io identity to the polypeptide firan which it is derived. j 

The p^t sequence identity between two polypeptides maybe determined using 
stable computer programs, as discussed below. Such variants may be natural or 
-adeusiug the methods of protein engineering and site-directed mutagenesis as are 
well known in the art. 
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SaSm ^' fa tte C0m4Xt ° f HH to . protdn who™ at one ox ™ 
posmonafhere have teen deletion*. TTn 1 au K fh wen t m ay<^ II ^at m o a 5 „ 
20, 30, 40 or 50% of «h e coo^Iete sequence of the fell . .natexe TO ^ 
TSP^ally a fragment comprise, up to 60%, more ^ 70%> preferab|y 

to 80%, mom preferably up to 90%, even « prefer ^ to ^ yet ^ 
Preferably up te 9 o% of fte compIete ofthe ftn m ^ 

P-feted flagmen* of PDI protein course one or more whole domain, of fte 
desired protein. 

E is particularly preferred that a plasmid aeeordiug to a feat, second or ftW 
aspects of a. invention taeh.de,, eifcer within a polynucleotide sequence 
msemou, or elsewhere on the pjasnud, an open ^din* ftamk encoding a protein 
contpnaing ft. sequence of aftumta or a Augment or variant thereof 
Alternatively, the host cell into which the plasmid is tansformed may include' 

i 

32 



|0P89820 : l^23;yD:ec j03;:aT2 - 49?| 



23/12 2003 12:30 FAX 01159522201 



ERIC POTTER CLARKSON HI0»8 



within its genome a polynucleotide sequence encoding a protein comprising the 
sequence of albumin or a fragment or variant thereof, either as an endogenous or 
heterologous sequence. 

5 By "albumin" we include a protein having the sequence of an albumin protein 
obtained firom any source. Typically the source is mammalian. In one preferred 
embodiment the serum albumin is human serum albumin ( <e HSA">- The term 
"human serum albumin" includes the meaning of a serum albumin having an 
amino acid sequence naturally occurring in humans, and variants thereof. 

10 Preferably the albumin has the amino acid sequence disclosed m WO 90/13653 or 
a variant thereof The HSA coding sequence is obtainable by known methods for 
isolating cDHA corresponding to human genes, and is also disclosed in, for 
example, EP 73 646 and EP 286 424. 

is In another preferred embodiment the "albumin" has the sequence of bovine scrum 
albumin. The term "bovine serum albumin" includes the meaning of a serum 
albumin having an amino acid sequence naturally occurring in cows, for example 
as taken from Swissprot accession number P02769, and variants thereof as defined 
below. The term "bovine serum albumin" also includes the meaning of fragments 

20 of full-length bovine serum albumin or variants thereof; as defined below. 

In another preferred embodiment the albumin is an albumin derived from (i.e. has 
ihe sequence of) one of serum albumin from dog (e.g. see Swissprot accession 
number P49822), pig (e.g. see Swissprot accession number P0SS35), goat (e.g. as 

25 available from Sigma as product no. A2514 or A4l 64), turkey (e.g. see Swissprot 
accession number 073860), baboon (e.g. as available from Sigma as product no. 
A1516), cat (e.g. see Swissprot accession number P49064), chicken (e.g. see 
Swissprot accession number P19121), ovalbumin (e.g. chicken ovalbumin) (e.g. 
see Swissprot accession number P01012), donkey (e.g. see Swissprot accession 

30 number P3909Q), guinea pig (e.g. as available from Sigma as product no. A3060, 
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A2639, 05483 or A6539), hamster (e.g. a s available from Sigma as product no . 
A5409), horse (e.g. see Swi sspr ot accession number P35747), rhesus monkey (e.g, 
see Swissprot accession number Q2S522), mouse (e.g. se ' e Swissprot accession 
number O89020), pigeon (e.g. as defmed by Khan et al, 2002, Int. J. Biol 
Macromol, 30(3-4), 17 1-8), rabbit (e.g. see Swissprot accession number P49065), 
rat (e.g. see Swissprot accession number P36953) and sheep (e.g. see Swissprot 
accession number P14639) and includes variants and fragment, thereof as defined 



below, 



io Many naturally occurring mutant forms of albumin are' known. Many are 
described in Peters, (1996, All About Albumin: Biochemistry, Genetics and Medical 
Applications, Academie Press, Inc., San Diego, California, p.170-181). A variant as 
defined above may be one of these naturally occurring mutants. 

is A 'Variant albumin" refers to an albumin protein herein at one or more positions 
there have been amino acid insertions, deletions, or substitutions, either conservative 
or non-conservative, provided that such changes result in an albumin protein for 
wmch at least one basic property, for example binding activity (type of and specific 
acuvuy e .g. binding to bilirubin), osmolarity (oncotic pressure, colloid osmotic 
* pressure), behaviour in a certain pH-range (pH-stability) has ,not significantly been 
changed. "Significantly" in this context means that one .killed in the art would say 
to the properties of the variant may still be different but would not be unobvious 
over the ones of the original protein. 



25 



By "conservative substitutions" is intended combinations such as Gly Ala- Val lie 
Leu; Asp, Glu; Asn, Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. Such variants may be 
-*de by techniques well *nown in the art, such as by site-diWd mutagenesis as 
^closed in US Patent No 4,302,386 issued 24 November 1981 to Stevens 
incorporated herein by reference. j 
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Typically an albumin variant will have more than 40%, usually at least 50%, more 
typically at least 60% ? preferably at least 70%, more prefekbly at least 80%, yet 
more preferably at least 90% 5 even more preferably at least ^5% 5 most preferably at 
least 9S% or more sequence identity with naturally occurring albuirrin. Tine percent 
5 sequence identity between two polypeptides may be determined using suitable 
computer programs, for example the GAP program of the University of Wisconsin 
Genetic Computing Group and it will be appreciated thiat percent identity is 
calculated in relation to polypeptides whose sequence has been aligned optimally. 
The alignment may alternatively be carried out using the Clustal W program 
10 (Thompson et aL 7 1994). The parameters used may be as follows: 



Fast pairwise alignment parameters: K-tuple(woid) size; 1, window size; 5, gap 
penalty; 3 ? number of top diagonals; 5. Scoring method: x percent Multiple 
alignment parameters: gap open penalty; 10, gap extension penalty; 0-05. Scoring 
15 matrix: BLOSUM. 

i 

The term "fragment" as used above includes any fragment of full-length albumin 
or a variant thereof, so long as at least one basic property, for example binding 
activity (type of and specific activity e.g. binding to bilirubin), osmolality (oncotic 

20 pressure, colloid osmotic pressure), behaviour in a certain pH-xange (pH-stability) 
h^s not significantly been changed. "Significantly* 3 in this context means that one 
skilled in the art would say that the properties of the variant may still be different but 
would not be unobvaous over the ones of the original protein. A fragment will 
typically be at least 50 amino acids long. A fragment may comprise at least one 

25 whole sub-domain of albumin. Domains of HSA have been expressed as 
recombinant pioteins (Dockal, M. et aL, 1999, J. Biol Chem. ? 274, 29303-29310), 
where domain I was defined as consisting of amino acids 1-197, domain II was 
defined as consisting of amino acids 189-385 and domain, III was defined as 
consisting of amino acids 381-585. Partial overlap of the domains occurs because 

30 of tlie extended a-helix structure (hlO-hl) which exists between domains I and II, 

35 

i 

! 
i 
i 



23/12 2003 



12:31 FAX 01159522201 



ERIC POTTER CLARKSQN 



II 041 

• 



30 



and between domains H and m (Peters, 1996, op. cit, Table 2-4). HSA also 
comprises six sub-domains <sub-dom ains IA, IB, HA, IIB,. IHA and Hffi). Sub . 
doman. IA comprises amino acids 6-105, sub-domain IB comprises amino acid s 
120-177, snb-domain IIA comprises amino acids 200-291, snb-domain IIB 
5 comprises amino acids 316-369, sub-domain HIA comprises amino acid, 392-491 
and sub-domain 1MB comprises amino acids 512-5S3. A fragment may comprise a 
whole or part of one or more domains or sub-domain* as defined above, or any 
combmation of those domains anoVor sub-domains. 

io Thus the polynucleotide insertion may comprise an open reading frame mat 
encodes albumin or a variant or fragment thereof. 

The skilled person will also appreciate that the open reading frame of any olher 
gene or variant, or part or either, can be utilised to form a whole or part of an open 
15 reading frame in forming a polynucleotide sequence insertion for use with the 
present invention. For sample, the open reading frame may encode a protein 
compnsmg any sequence, be it a natural protein (including a zymogen), or a 
vanant, or a fragment (which may, for example, be a domain) of a uatural protein; 
or a totally synthetic protein; or a single or multiple fusio* of different proteins 
> (natmal or synthetic). Such proteins can be taken, but not exclusively, from the 
hsts provided in WO 01/79258, WO 01/79271, WO 01/79442= WO 01/79443 WO 
01/79444 and WO 01/79480, or a variant or fragment thereof; the disclosures of 
wbrch are incorporated herein by reference. Although these patent applications 
present the list of proteins in the context effusion partners for albumin, the present 
mvennon is not so limited and, for the purposes of the present invention, any of 
the proteins listed therein may be presented alone or as.&sion partners for 
albumm, the Fc region of immunoglobulin, transferrin, lact^ or any other 
protein or fragment or variant of any of the above, including fusion proteins 
compnsmg any of the above, as a desired polypeptide. Examples of transferrin 
fusions are given in US patent applications US2003/0221201 and 
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US2003/0226155, Shin, et aL, 1995, Proc Nail Acad Sci US A 92, 2820; AH, et 
at, 1999, J Biol Chem, 274, 24066; Mason, et aL 7 2002, Bidchemistty, 41, 9448. 

Preferred other examples of desirable proteins for expression by the present 

5 invention includes sequences comprising the sequence of a monoclonal antibody^ 
an etoposide, a serum protein (such as a blood clotting faictor), antistasin, a tick 
anticoagulant peptide, transferrin, lactofenin, endostatin, angiostatm, collagens, 
immunoglobulins, Fab' ftagments, F(ab')2> scAb, scFv, interferons, mterleukins, 
3L1Q, IL11 3 IL2, interferon a species and sub-species, interferon p species and 

10 sub-species, interferon y species and sub-species, leptin, C1STIF, CNTF Ax i5 a XiLl- 
receptor antagonist, erythropoetin (EPO) and EPO mimics^ thiombopoetin (TPO) 
and TPO mimics, prosaptide, cyanovirin-N, 54ielix, T20 peptide, T1249 peptide, 
HTV gp41 ? HIV gpl20, urokinase, prourokiaase ? tPA (tissue plasminogen 
activator), hirudin, platelet derived growth factor, parathyroid hormone, 

15 ' proinsulin, insulin, glucagon, glucagon-like peptides, insulin-like growth fkctor, 
calcitonin, growth hormone, transforming growth factor (5, tumour necrosis factor, 
G-CSF, GM-CSF, M-CSF, FGF, coagulation factors in both pre and active forms, 
including but not limited to plasminogen, fibrinogen, thrombin, pre-thrombin, pro- 
thrombin, von Willebrand's factor, ai-antitrypsin, plasminogen activators, Factor 

20 VII, Factor VIII^ Factor IX, Factor X and Factor XIII, nerve growth factor, LACI 
(lipoprotein associated coagulation inhibitor, also known as tissue factor pathway 
inhibitor or extrinsic pathway inhibitor), platelet-derived endothelial cell growth 
factor (PD-ECGF), glucose oxidase, serum cholinesterase 3 aprotinin, amyloid 
precursor, inter-alpha trypsin inhibitor, antithrombin III, apo-Hpoprotedn species, 

25 Protein C, Protein S, a variant or fragment or fusion protein of any of the above. 

A 'Variant" in the context of the above-listed proteins, refers to a protein wherein at 
one or more positions there have been amino acid insertions, deletions, or 
substitutions, either conservative or non-conservative, provided that such changes 
30 result in a protein whose basic properties, for example enzymatic activity or receptor 
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binding (type of and specific activity), fcennostabflity, activity in a certain pH-range 
(pH-stabuhy) have not signitlcantiy been changed. "Sigmficantly" in this context 
means that one .killed in the ait would wfl ^t the properties bf the variant may still 
be Afferent but would not be unobvious over the ones of the original protein 



I 



By Conservative substitutions" is intended combinations sucli as Val, lie Leu Ala, 

Met Asp, Gin; Asn, Gin; Ser, Tbr, Gly, Ala; Lys, Arg, His; and Phe^T^Trp 

Preferred conservative substitutions include Gly, Ala; Val, He, Leu; Asp, GI«; Asn, 

Gin; Ser, Thr; Lys, A*g; and Phe, Tyr. 
10 j 

A "variant" typically has at least 25%, at least 50%, at least '60% or at least 70% 
preferably at least 80%, more preferably at least 90%, even more preferably at ]east ' 
95%, yet more preferably at least 99%, most preferably at jeast 99.5% sequent 
identity to the polypeptide from which it is derived. 

15 

The percent sequence identity between two polypeptide, may be determined nring 
stable eornpmer programs, fox example the GAP program! of ^varsity of 
Wrseonshr Genetie Cornpnttag Qronp and i, will be abated that percent 
rdenoty >s oatarlated in relation to polypeptides whose ae.per.ee has been aligned 
20 optimally. 

The aligns, ^ atemeliv6Ly „„ esnicd ^ usi „ g ^ ! ansta] w 

(Thompson « si, (1994) iWrfe Acids He,, 12Q2), 4673-SO). The parameter, 
fcsed may be as follows: 

* . ^P^Walignmen^^ 

penalty; 3, number of top diagonals; 5. Scoring method; x percent. 

Multiple alignment parameter,: gap open penalty; 10, gap extension penalty 



0.05. 

• Scoring matrix: BLOSUM. 



30 



I 
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Such variants may be natural or made using the methods of protein engineering and 
site-directed mutagenesis as are well known in the art. 

A "fragment", in the context of the abov^listed proteins, ref&s to a protein wherein 
5 at one or more positions there have been deletions. Thus the fragment may comprise 
at most 5, 10, 20, 30, 40 or 50% of the complete sequence of the full mature 
polypeptide. Typically a fragment comprises up to 60%, moie typically up to 70%, 
preferably up to 80% more preferably up to 90%, even moid preferably up to 95%, 
yet more preferably up to 99% of the complete sequence of the full desired protein. 
io Particularly preferred fragments of a desired protein comprise one or more whole 
domains of the desired protein- 



It is particularly preferred that a plasmid according to a first, second or third 
aspects of the invention includes, either within a polynucleotide sequence 

15 insertion, or elsewhere on the plasmid., an open reading frame encoding a protein 
comprising the sequence of albumin or a fragment or variant thereof, or any other 
protein take from the examples above (fused or unfused to a fusion partner) and at 
least one other heterologous sequence, wherein the at least one other heterologous 
sequence may contain a transcribed region^ such as an open reading frame. In one 

20 embodiments the open reading frame may encode a projtein comprising the 
sequence of a yeast protein. In another embodiment the opeh reading frame may 
encode a protein comprising the sequence of a protein involved in protein folding, 
or which has chaperone activity or is involved in the unfolded protein response, 
preferably protein disulphide isomerase. 

25 

The present invention also provides a method of preparing a plasmid of the 
invention, which method comprises - 

(a) providing a 2pm-family plasmid comprising a REP2 gene or an FLP gene 
30 and an inverted repeat adjacent to said gene; 

39 
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15 



providing a polynucleotide sequence and inserting the polynucleotide 
sequence into the plasmid at a position according to the first, second or 
third preferred aspects of the invention; and/or 

additionally or as an alternative to step (b), deleting some or all of the 
nucleotide oases at tiie positions according to the first, second or third 
preferred aspects of the invention; and/or 



10 (d) additionally or as an alternative to either of steps (b) and (c), substituting 
some or all of the nucleotide bases at the positions according to the first, 
second or third preferred aspects of the invention with alternative' 
nucleotide bases. 



20 



Steps (b), (c) and (d> can be achieved using techniques well known in the art, 
including cloning techniques, site-directed mutagenesis and-the like, such as are 
described in by Sambrook et *L, Molecular Cloning: A Laboratory Manual, 2001, 
3rd edition, the contents of which are incorporated herein by reference. For 
example, one such method involves ligation via cohesive ends. Compatible 
cohesive ends can be generated on a DN A fragment for insertion and plasmid by the 
action of suitable restriction enzymes. These ends will rapidly anneal through 
complementary base pairing and remaining nicks can be closed by the action of 
DNA ligase. 



25 



30 



A further method uses synthetic double stranded oligonucleotide linker and 
adaptors. DNA fragments with blunt ends a*, generated by bacteriophage T4 DNA 
polymerase or Kcoli DNA polymerase I which remove protruding 3' termini and fill 
m recessed 3' ends, Synthetic tinkers and pieces of blunt-ended double-stranded 
DNA, which contain recognition sequences for defined restriction enzymes, can be 
hgated to blunt-ended DNA fragments by T4 DNA ligase. They are subsequently 
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digested with appropriate restriction enzymes to create cohesive ends and ligated to 
an expression vector with compatible termini. Adaptors are also chemically 
synthesised DNA fragments which contain one blunt end used for ligation but which 
also possess one preformed cohesive end Alternatively a DNA fragment or DNA 
5 fragments can be ligated together by the action of DNA ligase in the presence or 
absence of one or mora synthetic double stranded oligonucleotides optionally 
containing cohesive ends. 

Synthetic linkers containing a variety of restriction endonvclease sites are 
10 commercially available from a number of sources including Sigma-Genosys Ltd, 
London Road, Pampisford, Cambridge, United Kingdom. 

Accordingly, the present invention also provides a plasmid obtainable by the 
above method. 

15 

The present invention also provides a host cell comprising , a plasmid as defined 
above. The host cell may be any type of cell. Bacterial and yeast host cells are 

preferred. Bacterial host cells may be useful for cloning purposes. Yeast host 

cells may be useful for expression of genes present in the plasmid. 

20 

In one embodiment the host cell is a cell in which the plasmid is stable as a 
multicopy plasmid. Plasmids obtained Scorn one yeast type can be maintained in 
other yeast types (Irie et al 9 1991, Gene, 108(1), 139-144; Irie et al> 1991, Mol 
Gen. Genet, 225(2), 257-265). For example, pSRl from Zygosacckaromyces 

25 rouxii can be maintained in Saccharomyces cerevisiae. Where the plasmid is 
based on pSRl, pSB3 or pSB4 the host cell may bs Zygosaccharomyces rouxiU 
where the plasmid is based on pSBl or pSB2 the s host cell may be 
Zygosacckaromyces baitii, where the plasmid is based on pPMl the host cell may 
be Pichia membranaefaciens, where the plasmid is based on pSMl the host cell 

30 may be Zygosaccharomyces fermentati^ where the plasmid is based on pKDl the 
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host cell may be Kluyveromyces drosopkUarum and where the plasmid is based on 
the 2pm plasmid the host cell may be Saccharomyces cetevisiae or 
Saccharomyces carlsbergensis. A 2um-family plasmid of the invention can be 
said to be "based on" a naturally occurring plasmid if it comprises one, two or 
preferably three of the genes FLP t JREP1 and REP2 having sequences derived 
from that naturally occurring plasmid, 

A plasmid as defined above, may be introduced into a host through standard 
techniques. With regard to transformation of piokaryohc host cells, see, fo r 
example, Cohen et al (1972) Proc. Natl. Acad. Sci. USA 69, 21 10 and Sambrwk et 
al (2001) Molecular Cloning, A Laboratory Manual, 3 rf Ed. Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY. Transformation of yeast cells is described in 
Sherman et al (1926) Methods In Yeast Genetics, A Laboratory Manual, Cold Spring 
Harbor, NY. The method of Beggs (1978) Nature 275, 104-109 is also useful. 
Methods for the transformation of S. cerevisiae are taught! generally in El> 251 
744, EP 258 067 and WO 90/01063, all of which are incorporated herein by 
reference. With regard to vertebrate cells, reagents useful in transfecting such cells, 
for example calcium phosphate and DEAE-dextran or liposome formulations, are 
available from Stratagene Cloning Systems, or Life Technologies Inc., Gaithersburg 
20 MD 20877, USA. 



10 



15 



Eleetroporation is also useful for transforming cells and is well known in the art for 
transforming yeast cell, bacterial cells and vertebrate cells. Methods for 
transformation of yeast by eleetroporation are disclosed M Becker & Guarente 
25 (1990) Methods Enzymol 194, 182. 

Generally, the plasmid will Worm not all of the hosts and it will therefore be 
necessary to select for transformed host cells. Thus, a plasmid according to any one 
of the first, second or third aspects of the present invention may comprise a 
30 selectable marker, either within a polynucleotide sequence insertion, or elsewhere 

42 
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on the plasmid, including but not limited to bacterial selectable marker and/or a 
yeast selectable marker. A typical bacterial selectable marker is the P-lactamase 
gene although many others are known in the art. Typical yeast selectable marker 
include LEU2 (or an equivalent gene encoding a protein with the activity of p- 
5 lactamase malate dehydrogenase), TRP1, HIS3, HTS4, WA3, URA5, SFA1, ADE2, 
METIS, LYS5, LYS2, ILV2, FBA1 and PGK1. Those skilled in the art will 
appreciate that any gene whose chromosomal deletion or inactivation results in an 
inviable host, so called essential genes, can be used as a selective marker if a 
functional gene is provided on the plasmid, as demonstrated for PGK1 in a pgkl 
io yeast strain (Piper and Curran, 1990, Curr. Genet. 17, 119). Suitable essential 
genes can be found within the Stanford Genome Database (SGD), 
htto:://db.yeastgenome.org). 

Additionally, a plasmid according to any one of the first, second or third aspects of 
15 the present invention may comprise more than one selectable marker, either within 
a polynucleotide sequence insertion, or elsewhere on the plasmid. 

Qne selection technique involves incorporating into the expression vector a DNA 
sequence marker, with any necessary control elements, that jcodes for a selectable 
20 trait in the transformed cell. These markers include dihydrofolate reductase, G41 S or 
neomycin resistance for eukaryotic cell culture, and tetracyclin, kanamycin or 
ampicillin (i.e. (3-lactamase) resistance genes for culturing in E.cott and other 
bacteria. Alternatively, the gene for such selectable trait can be on another vector, 
which is used to co-transform the desired host cell. 

25 

Another method of identifying successfully transformed cells* involves growing the 
cells resulting from the introduction of a plasmid of the invention, optionally to allow 
the expression of a recombinant polypeptide (i.e. a polypeptide which is encoded by 
a polynucleotide sequence on the plasmid and is heterologous to the host cell, in the 
30 sense tirat that polypeptide is not naturally produced by the host). Cells can be 
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harvested and lysed and their DNA or RNA content examined fox the presence of the 
recombinant sequence using a method such as that described by Southern (1975) J. 
Mol Biol 98, 503 or Berent et al (1985) Biotech 3, 208, or other methods of DNA 
andRNA analysis common in the art. Alternatively, the presence of a polypeptide in 
the supernatant of a culture of a transformed eel] can be detected using antibodies. 

In addition to directly assaying for the presence of recombinant DNA, successful 
transfoimation can be confirmed by well known irnmunologicai methods when the 
recombinant DNA is capable of directing the expression of the protein. For example, 
cells successfully transformed with an expression vector produce proteins displaying 
appropriate antigenicity. Samples of cells suspected of being transformed are 
harvested and assayed for the protein using suitable antibodies. 

Thus, in addition to the transformed host cells themselves, the present invention also 
is contemplates a culture of those cells, preferably a monoclonal (clonally 
homogeneous) culture, or a culture derived from a monoclonal culture, in a nutrient 
medium. 

I 

Transformed host cells may then be cultured for a sufficient time and under 
20 appropriate conditions known to those skilled in the art; and in view of the 
teachings disclosed herein, to permit the expression of any ORF( s ) in the one or 
more polynucleotide sequence insertions within the plasmid. 



25 



The present invention thus also provides a method for producing a protein 
comprising the steps of (a) providing a plasmid according to the first, second or 
third aspects of the invention as defined above; (b) providing a suitable host cell; 
(o) transforming the host cell with tiie plasmid; and (d) culturing the transformed 
host cell in a culture medium, thereby to produce the protein: 
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Many expression systems are known, including bacteria (for example E. coli and 
Sacrf/wtf subtilis), yeasts, filamentous fungi (for example Aspergillus), plant cells, 
whole plants, animal cells and insect cells. 

i 

i 

5 In one embodiment the preferred host cells are the yeasts in which the plasmid is 
capable of being maintained as a stable multicopy plasmid' Such yeasts include 
Saccharomyces cerevisiae, Kluyveromyces lactis, Pichia pastoris, 
Zygosaccharomyces rowdi, Zygosaccharomyces bailli, Zygos-accharomyces 
fermentati, said Kluyveromyces drosophilarum. 



10 



A plasmid is capable of being maintained as a stable multicopy plasmid in a host, 
if the plasmid contains, or is modified to contain, a selectable (e.g. LEU2) marker, 
and stability, as measured by the loss of the marker, is at least 1%, 2%, 3%, 4%, 
5%, 10%, 15%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 

15 .95%, 96%, 97%, 98%, 99%, 99-9% or substantially 100% .after one, two, three, 
four, five, six, seven, eight, nine ten, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 
35, 40, 45, 50, 60, 70, 80, 90, 100 or more generations. Loss of a marker can be 
assessed as described above, with reference to Cbinery & Hmchliffe (1989, Cwr. 
Genet., 16,21-25). 

20 ' 

It is particularly advantageous to use a yeast deficient intone or more protein 
mannosyl transferases involved in O-glycosylation of proteins, for instance by 
disruption of the gene coding sequence. 

i 

25 Recombinantly expressed proteins can be subject to undesirable post-translational 
modifications by the producing host cell. For example, | the albumin protein 
sequence does not contain any sites for N-linked glycosylafion and has not been 
reported to be modified, in nature, by O-linked glycosylatipn. However, it has 
been found that recombinant human albumin ("rHA' 1 ) produced in a number of 
30 yeast species can be modified by O-linked glycosylating generally involving 
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mannose. The mannosylated albumin is able to bind to the le'ctin Concanavalin A. 
The amount of mannosylated albumin produced by the yeast can be reduced by 
using a yeast strain deficient in one ox more of the PMT genes (WO 94/04687). 
The most convenient way of achieving this is to create a yeast which has a defect 
5 in its genome such that a reduced level of one of the Pmt proteins is produced. For 
example, there may be a deletion, insertion or transposition in the coding sequence 
or the regulatory regions (or in another gene regulating the expression of one of 
the PMT genes) such that little or no Pmt protein is produced. Alternatively, the 
yeast could be transformed to produce an anti-Pmt agent, such as an anti-Pmt 
10 antibody. 

If a yeast other than S. cerevisiae is used, disruption of one or more of the genes 
equivalent to the PMT genes of £ cerevisiae is also beneficial, e.g. in Pichia 
pastoris or Ktuyveromyces lactis. The sequence of PMTI (or any other PMT 
15 gene) isolated from S. cerevisiae may be used for the identification or disruption 
of genes encoding similar enzymatic activities in other fungal species. The 
cloning of me PMTI homologue of Kluyverotnyces lactis is described in WO 
94/04687. 

20 The yeast will advantageously have a deletion of the HSP15.0 and/or YAP3 genes 
as taught respectively in WO 95/33833 and WO 95/23857. 

The present application also provides a method of producing; a protein comprising 
the steps of providing a host cell as defined above, which host cell comprises a 
25 plasmid of the present invention and culturing the host cell in a culture medium 
rnereby to produce the protein. The culture medium may be non-selective or place 
a selective pressure on the stable multicopy maintenance of the plasmid. 



i 
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A method of producing a protein expressed from a plasmid of the invention 
preferably further comprise the step of isolating the thus produced protein from the 

cultured host cell or the culture medium. 1 

i 
j 

5 The thus produced protein may be present intracellularly or, if secreted, in the 
culture medium and/or periplastic space of die host cell. The protein may be 
isolated from the cell and/or culture medium by many methods known in the art. 
For example purification techniques for the recovery of recombinantly expressed 
albumin have been disclosed in: WO 92/04367, removal of matrix-derived dye; 

10 EP 464 590,, removal of yeast-derived colorants; EI^ 319 067, alkaline 
precipitation and subsequent application of the albumin to a lipophilic phase; and 
WO 96/37515, US 5 728 553 md WO 00/44772, which describe complete 
purification processes; all of which are incorporated herein by reference. Proteins 
other than albumin may be purified from the culture medium by any technique that 

is Has been found to be useful for purifying such proteins. 

Such well-known methods include ammonium sulphate or efhanol precipitation, acid 
or solvent extraction, anion or cation exchange chromatography, pbosphocellulose 
chromatography^ hydrophobic interaction chromatography, affinity chromatography, 
20 hydroxylspatite chromatography, lectin chromatography, concentration, dilution, pH 
adjustment, diafiltration, ultrafiltration, high perfbrmance liquid chromatography 
("HPLC"), reverse phase HPLC, conductivity adjustment and the like. 

In one embodiment* any one or wiore of the above mentioned techniques may be 
used to further purifying the thus isolated protein to a commercially acceptable 
level of purity. By commercially acceptable level of purity, we include the 
provision of the protein at a concentration of at least 0,01 gX~\ 0.02 gX" 1 , 0.03 gX~ 
\ 0.04 gX \ 0.05 g.L ^0.06 gX" 4 A07 gJj\ 0.08 gX" 1 , 0.09 g r\ 0.1 gX"\ 0.2 gX w 
03 gJj\ OA S JL'\ 0,5 gX" 1 , 0.6 gX J , 0.7 gX 1 , 0,8 gX' 1 , 0:9 gX A , 1 gl" 1 , 2 gX^ 
3 gX"\ 4 gX -1 , 5 g.V\ 6 gX' 1 , 7 gX\ 8 gX -1 , 9 gX" 1 , 10 gX L \ 15 g.X 1 , 20 gX" 1 , 25 

47 



25 



30 



23/12 2003 12:35 FAS 01159522201 ERIC POTTER CLARKSON I2, 05 3 

• 4» 



gX 1 , 30 g.L'\ 40 g.X'.SO g.V\ 60 gX" 1 , 70 gl" 1 , 70 gX' 1 , 90 gX' 1 , 100 gX" 1 , 150 
gX" 1 , 200 gX'^SO gX" 1 , 300 gX\ 350 g.U\ 400 gX 1 , 500 gX l , 600 gX" 1 , 700 
gX' 1 , 800 gJL" 1 , 900 gX" 1 , 1000 gX"' . or more. 

5 The thus purified protein may be lyophilised. Alternatively it TO ay be formulated 
with a carrier or diluent, and optionally presented in a unit form. 

It is preferred mat the protein is isolated to achieve a pharmaceutically acceptable 
level of purity. A protein has a pharmaceuticaUy acceptable level of purity if it is 
io essentially pyrogen free and can be administered in a pharmaceutically efficacious 
amount without causing medical effects not associated with the activity of the 
protein. 

The resulting protein may be used for any of its known utilities, which, in the case 
15 of albumin, include i.v. admfoistration to patients to treat severe burns, shock and 
blood loss, supplementing culture media, and as an excipient in formulations of 
other proteins. '< 

Although it is possible for a merapeuucally useful desired protein obtained by a 
20 process of the invention to be administered alone, it is preferable to present it as a 
pharmaceutical formulation, together with one or more acceptable carriers or 
diluents. The carrier(s) or diluent^) must be "acceptable" in the sense of being 
compatible with the desired protein and not deleterious to trie recipients thereof. 
Typically, the carriers or diluents will be water or saline which will be sterile and 
25 pyrogen free. 

Optionally the thus formulated protein will be presented in a unit dosage form, 
such as in the form of a tablet, capsule, injectable solution or the like. 



48 



|PQ89Sg0f m®Mm$® 2:49 



23/12 2003 12:35 FAX 01159522201 



ERIC POTTER CLARK SDN 



@054 



10 



BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 shows a plasmid map of the 2pmplasnrid. 
5 Figure 2 shows a plasmid map of pSAC35. 

Figure 3 shows some exemplified FLP insertion sites. 

Figures 4 to 8, 10, 11, 13 to 32 and 36 shows maps of various plasmids. 

Figure 9 shows fee DNA fragment from pDB2429 comaining the PDI1 gene. 

Figure 12 shows some exemplified REF2 insertion sites, 
is Figure 33 shows table 3 as referred to hi Aft Examples. 

Figure 34 shows the sequence of SEQ ID NO: 1 . 
Figure 35 shows the sequence of SEQ ID NO: 2. 

20 

EXAMPLES 

• These example describes the insertion of additional DNA sequences into a number 
of positions, defined by restriction endonuclease sites, within the US-iegion of a 
25 2um-family plasmid, of the type shown in Figure 2 and generally designated 
pSAC35, which includes a p-lactamase gene (for ampiciUin resistance, which is 
lost from the plasmid following transformation into yeast); a LEU2 selectable 
marker and an oligonucleotide linker, the latter two of whicn are inserted into a 
unique 5fcaBI-site within the UL-region of the 2nm-family disintegration vector, 
pSAC3 (see EP 0 286 424). The sites chosen were towards the 3'-ends of the 



30 
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REP2 and FLP coding regions or m the downstream inverted repeat sequences 
Short synthetic DNA linkers were inserted into each site, and the relative 
stabilities of the modified plasmids were compared during growth on non-selective 
media. Preferred sites for DNA insertions were identified: Insertion of larger 
DNA fragments containing "a gene of interest" was demonstrated by inserting a 
DNA fragment containing the PDI1 gene into the JK?/«I-site after REP2. 



10 



EXAMPLE 1 



Insertion of Synthetic J>NA Linker into Xcml^ites in the Small Unique Region 
ofpSAC35 



Sites assessed initially for insertion of additional DNA into the US-region of 
PSAC35, were the X CM I- 5 ites in the 59*bp inverted repeats. One J&ral-site cuts 
15 51-bp after die REP2 translation termination codon, , whereas, the other X^I-site 
cuts 127-bp before the end of the FLP coding sequence, due. to overlap with the 
inverted repeat (see Figure 3). 

The sequence inserted was a 52-bp linker made by annealing 0.5mM solutions of 
20 oligonucleotides CF85 and CF87. This DNA linker contaW a core region 

"Snam-Pacl-FseVSfil-Smal-SnaBI". which encoded restrietion sites absent from 
pSAC35. 



25 



B££X 

-Pad 



SnaBX 



30 WSB1 ^el smax 



PPS7 T ™ TGGTA ^TATEAATT aAGGCcJflcJ AGGCCCGSGT ACGTACC&AT TGA 
CF87 TCCTCACCAT GC&TAATTAA TTCCGGCCGfl TCCGGGCCCA SSJSSJI If 
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Plasmid pSAC35 was partially digested with Xcml, the linear 1 1-kb fragment was 
isolated from a 0.7%(Wv) agarose gel, ligated with the CF86/CF87 Xcml linker 
(neat, 10" 1 and 10" 2 dilutions) and transformed into & coli DH5a. ArnpiciBin 
resistant transformants were selected and screened for the presence of plasmids 
5 that could be linearised by Smal digestion. Restriction enzyme analysis identified 
pDB26BS (Figure 4) with the linker cloned into the Xcml-siie after REP2. DNA 
sequencing using oligonucleotides primers CF88, CF98 and CF99 (Table 1) 
confirmed the insertion contained the correct linker sequence. 

10 Table 1: Oligonucleotide sequencing primers: 



Primer 


Description 


Sequence t 


CFS8 


REP2 primer, 2Qmer 


5'-ATCACGTAATACTTCTAGGG-3' 


CF98 


EEP2 primer, 20mer 


5'-AGAGTGAGTTGGAAGGAAGG-3 ' 


CF99 


REP2 primer, 20mer 


5 '-AGCTCGTAAGCGTCGTTACC-3 ' 


CF90 


FLP primer, 20mer 


5 '-CTAGTTTCTCGGTACT ATGC-3 ' 


CF91 


FLP primer, 20mer 

1 


5 1 -GAGTTGACTAATGTTGTGGG-3 5 
1 


CF100 


FLP primer, 20mer 


5 '-AAAGCITIX3AAGAAAAATGC-3 5 


CF101 


FLP primer, 20mer 


5 '-GCAAGGGGTAGGATCGATCC-3 * 


CF123 


pDB2783 MCS, 
24mer 


5 '-ATTCGAGCTCGGTACCTACGTACT-3 * 

i 
1 




I 
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Primer 


Description 


i 

Sequence 


CF126 


pDB2783 MCS, 
24rner 


5 5 -CCCGOGCACGTGGGATCCTCTAGA-3 7 

r 


Forward 


pDB2783 MCS 7 
17mer 


S'-GTAAAACGACGGCCAQT-3' 


M13- 

Reverse 


pDB27S3 MCS, 
16mer 


5 '-AACAGCTATGACCATG-3 a 



Restriction enzyme analysis also identified P DB2689 (Figure 5), with the linker 
cloned into the Xcml-site in the FLP gene. However, ihe linker in pDB26S9 was 
shown by DNA sequencing using primers CF90 ^ CF91 to ' have a missing G;C 
base-pair within the FseV%il site (marked above in bold in the CF86+CF87 
linker). This generated a coding sequence for a mutant Flp-protein, with 39 C- 
terminal amino acid residues replaced by 56 different amino acids before the 
translation termination codon. 

The missing base-pair in the pDB2689 linker sequence was corrected to produce 
PDB27S6 (Figure 6). To achieve this, a 31-bp S'-phosphorylated o^oBI-linker 
was made from oligonucleotides CF104 and CF105. This was ligated into the 
.S^BI site of pDB2689, which had previously been treated with calf intestinal 
alkaline phosphatase. DNA sequencing with primers CF90, CF91, CFI00 and 
CF101 confirmed the correct DNA linker sequence in P DB2?k This generated a 
coding sequence for a mutant Pip-protein, with 39 C-terminal residues replaced by 
14 different residues before translation termination. 
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SndBI Linker (CF1 (H+CFlOSt 

SfiT. 



FssI 

Pad SmaT 



CF104 Si-GTATTAATTA AGGCCGGCCA GGCCCGGGTA C 
CF10S CATAATTAAT TCOGGCCGG'T CCGGGCCCAT G-Pi 



An additional plasmid, pDB2798 (Figure 7), was also produced by ligation of the 
SnaBl linker in the opposite direction to pPB2786. The linker sequence in 
pDB2798 was confirmed by DNA sequencing. Plasmid pDB279S contained a 
coding sequence for a mutant Flp-protein, witfi 39 C-terminal' residues replaced by 
15 8 different residues before translation termination. 

A linker was also cloned into the XcM-site in the FLP gene to truncate the Flp 
protein at the site of insertion. The linker used was a 45-bp S'-phosphorylated 
JTemJ-linker made from oligonucleotides CF120 and CF121i . 



Xcml Linker fCF120+CF121> 

3f±J 



CPX20 p±-GTAATAATA CGTATTAATT AAGGCCGGCC AGGCCCGGST ACGTAA 

CKL21 TCATTATTAT GCATAATEAA TTCCOOCCGG TCCGGGCCCft, XGCAT-Pa 



This CF120/CF121 Xcml linker was ligated with U-kb ,pSAC35 fragments 
produced by partial digestion with Xcml, followed by treatment with calf intestinal 
alkaline phosphatase. Analysis of anipicillin resistant E. coli DH5a transformants 
identified clones containing pDB2823 (Figure 8). DNA sequencing with primers 
35 CF90, CF91, CF100 and CF101 confirmed the linker sequence in pDB2823. 
Translation termination within the linker inserted would result in the production of 
Flp (1-3 82)^ which lacked 41 C-terminal residues. i 
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The impact on plasmid stability from insertion of linker sequences into the Xcml- 
sites within the US-region of pSAC35 was assessed for pDB2<588 and pDB2<589. 
Plasmid stability was determined in a S. cerevisiae strain by loss of the LEU2 
marker during non-selective grown on YEPS. The same yeast strain, transformed 
with pSAC35, which is structurally similar to pSAC3, but contains additional 
DNA inserted at <he ^cBI site that contained a LEU2 selectable marker (Chinery 
& Hinchliffe, 1989, Curr. Genet., 16, 21), was used as the control. 

The yeast strain was transformed to leueine prototrophy using a modified lithium 
acetate method (Sigma yeast transformation kit, YEAST- 1, protocol 2; (Ito et al, 
1983, J. BacterioL, 153, 163- Elble, 1992, Biotechnigues, 13, 18)). Transformante 
were selected on BMMD-agar plates, and subsequently patched out on 

BMMD-agar plates. Cryopreserved trehalose stocks were prepared from lOmL 
BMMD shake flask cultures (24 his, 30°C, 2Q0rpm), 

The composition of YEPD and BMMD is described by Sleep et aL, 2002, Yeast 
IS, 403. YEPS and BMMS are similar in composition to YEPD and BMMD 
accept that 2% (w/v) sucrose was substituted for the 2% (w/v) glucose as the sole 
initial carbon source. 

For the determination of plasmid stability a ImL cryopreserved stock was thawed 
and inoculated into lOOmL YEPS (initial OD 600 w 0-04-0.09) in a 250.nL conical 
flask and grow for approximately 72 hours (70^74 his) at 30°C in an orbital shaker 
(200 rpm, Innova 4300 incubator shaker, New Brunswick Scientific). 

| 

Samples were removed from each flask, diluted in YEPS-Wh (10" 2 to 1Q J 
dilution), and lOOpX aliquots plated in duplicate onto YEPS-agar plates. Cells 
were grown at 30«C for 3-4 days to allow single colonies to develop. For each 
yeast stock analysed, 100 random colonies were patched in replica onto BMMS- 
agar plates followed by YEPS-agar plates. After growth at 3o| D C for 3-4 days the 
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percentage of colonies growing on both BMMS-agar plates a^d YBPS-agar plates 
was determined as the measure of plasmid stability. 

In the above analysis to measure the loss of the LEU2 marker from transforaiante, 
5 pSAC35 and P DB2688 appeared to be 100% stable, whereas P DB2689 was 72% 
stable. Hence, insertion of the linker into the ^bml-site after RBP2 had no 
apparent effect on plasmid stability, despite altering the transcribed sequence and 
disrupting the homology between Ihe 599^bp inverted repeats. Insertion of the 
linker at the Xcml-sit* in FLP also resulted in a surprisingly stable plasmid, 
10 despite both disruption of tiie inverted repeat and mutation of the Flp protein. 
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EXAMPLE 2 

I 

Insertion of the PDII Gene into the Xcml Linker of pDB2688 

The insertion of a large DNA fragment into the US-region jof 2um-likc vectors 
was demonstrated by cloning the S. cerevisiae PDII gene into the Xc^I-lmker of 
P DB26S8. The PDII gene (Figure 9) was cloned on a 1.9-kb Sacl-Spel fragment 
from a larger S. cerevisiae genomic DNA fragment containing the PDII gene, 
which had been cloned into YIplac211 (Gietz & Sughio, 1?88, Gene, 74, 527- 
534), which had been treated with T4 DNA polymerase to fill the Spel 5'- 
overhang and remove the Sad 3'-overhang. This PDI2 fragment included 212-bp 
of the PDII promoter upstream of the translation initiation, codon, and 148-bp 
downstream of the translation termination codon. This was ligated with Smal 
linearised/calf intestinal alkaline phosphatase treated P DB2688, to create plasmid 
pDB2690 (Figure 10), with the PDII gene transcribed in the same direction as 
EEP2, A S. cerevisiae strain was transformed to leucine prototrophy with 
pDB2690. 
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An expression cassette for a human transferrin mutant (NT413Q, N611Q) was 
subsequently cloned into die JVbtf-site of P DB2$9G to create pDB271 1 (Figure 1 1). 
The expression cassette in pDB271 1 contains the S. cerevisiae PRBJ promoter, an 
HSA/MFcc fusion leader sequence (EP 387319; Sleep et al, 11990, Biotechnology 
s (W.Y.), 8, 42) followed by a coding sequence for the humejn transferrin mutant 
(N413Q, N611Q) and the S. cerevisiae ADffJ terminator., Plasmid pDB2536 
(Figure 36) was constructed similarly by insertion of the same expression cassette 
into the jVofl-site of pSAC35. 

to The advantage of inserting "genes of interest" into the US-region of 2nm-vectors 
was demonstrated by the approximate 7-fold increase in recombinant transferrin 
N413Q, N611Q secretion during fermentation of yeast transformed with 
PDB2711, compared to the same yeast transformed with pDB2536. An 
approximate 15-fold increase in recombinant transferrin N413Q, N611Q secretion 

15 was observed in shake flask culture (data not shown). 

The relative stabilities of plasmids pDB26SS, P DB2690, pDB27ll, pDB2536 and 
PSAC35 were determined in the same yeast strain grown in YEPS media, using 
the method described above (Table 2). 
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Table 2: Summary of plasmid stability date for PDII inserticL into the small 
unique region of pSAC35. Date from 3 days growth in nonselective shake flask 
culture before plating on YEPS-agar. 
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Plasmid 


Insertion 
Site(s) 


Additional Details 


Relative 
Stability 


pSAC35 






100% 


P DB2688 


Xcml ' 


Linker in Inverted 
Repeat 


. 100% 

» 


pDB2690 


Xctnl 


PDII in Xcml Linker 


32% 


pDB2711 


XcmhNotl 


PDII in Xcml Linker rTf 
Cassette at Noil 


1 io% '; 

1 1 

1 1 


1 pDB2536 


NoH 


J rTf Cassette at Notl 


1 | 17% 



In this analysis, pDB2690 was 32% stable, compared to 100% stability for 
P DB268S wilbout the PDII insert. This decrease in plasmid stability was less than 
the decrease in plasmid stability observed witfe pDB2536, due to insertion of the 
rTF (N413Q, N61 1Q) expression cassette into the .tod-site within the large unique 
region of pSAC35 (Table 2). 

Furthermore, selective growth in minimal media during high cell density 
fermentations could overcome the increased plasmid instability due to the PDII 
insertion observed in YEPS medium, as the rTF (N413Q, N611Q) yield from the 
same yeast transformed with P DB2711 did not decrease compared to that achieved 
from the same yeast transformed with pDB2536. 
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EXAMPLE 3 

Insertion ofDNA Linkers into the REP2 Gene and Downstream Sequences in 
the Inverted Repeat of pSAC35 

i 

To- define the useful limits for insertion of additional DNA intp the REP2 gene and 
sequences in the inverted repeat downstream of it, further linkers were inserted 
into pSAC3S. Figure 12 indicates the restriction sites used for these insertions and 
ifce effects on the Rep2 protein of translation termination at these sites. 



10 



The linker inserted at the Xmnl-site in REP2 was a 44-bp sequence made from 
oligonucleotides CF108 and CF109. 



15 



20 



Xmnl Link er TCF1 0S-I-HF1 ftQ) 

Sfll t ... 

PacT 

SnaBI FeeX .Steal 



SnaBT 

i 



2S 



30 



SJm S^™^™ ^TATTAATTA AGGCCSScGA GOecCGGGTA CGTA 

CF109 TATTATTATG CATAATTAAT TCCGGCCGGT CCGGGCCCAT GCAT 

To avoid insertion into other JlW-sites in pSAC3S, the 3, 076-bp JCbdl fragment 
from pSAC35 that contained the REP2 and FLP genes was first sub-cloned into 
the R coli cloning vector pDB26B5 (Figure 13) to produce pDB2783 (Figure 14). 

Plasntid pDB2685 is a pUClfi-like cloniug vector derived from P CF17 containing 
apramycin resistance gene aac(3)IV &om Klebsiella pneumoniae (Rao etal, 1983, 
Antimicrob. Agents Chemother., 24, 689) and multiple cloning site from P MCS5 
(Hoheisel, 1994, Biotechniques, 17, 456). pCF17 was made from pU8600 (Sun et 
at, 1999, Microbiology, 145(9), 2221-7) by digestion with EcoKl, Nhel and the 
Klenow fragment of DNA polymerase I, and self-ligation, followed by isolation 
from the reaction products by transformation of competent E. coli DH5ot cells and 

58 



Si 063 



23 ,12 2003 12:38 FAX 0115952220! ERIC POTTER CI^KSON B ° M 



selection with apramycin sulphate. Plasmid pDB26S5 was constructed by clearing 
a 439bp Sspl-Swal fragment from pMCS5 into pCF17, wbidh had been out with. 
MscI and treated with calf intestinal alkaline phosphatase. Bhie/white selection is 
not dependant on IPTG induction. 

5 

Plasmid pDB27B3 was linearised whhXmnl and ligated with the CF108/CF109 
XmL-Uaba to produce pDB2799 (Figure 15) and pDB2780 (not shown). Plasmid 
pDB2799 contained ihe CF10S/CF109 XmnI linker in the correct orientation for 
translation termination at the insertion site to produce Rep2 (1-244), whereas 
10 pDB2780 contained the linker cloned in the opposite ioricntation DNA 
sequencing with primers CF98 and CF99 confirmed the correct linker sequences. 

The 3,120bp Xbal fragment from pDB2799 was subsequently ligated with a 
7 ,961-bp P SAC35 fragment which had been produced by partial Xbal digestion 
ls and treatment with calf intestinal alkaline phosphatase, to create plasmid P DB2S17 
(B-fbnn) and pDB2818 (A-fonn) disintegration vectors (Figures 16 and 17 
respectively). .' 

Insertion of linkers at the ^al-site in pSAC35 was performed with and without 
20 3'-5' exonuclease digestion by T4 DNA polymerase. This produced coding 
sequences for either Rep2 (1-271) or Rep2 (1-269) before translation termination. 
In the following figure, the sequence GGCC marked with; diagonal lines was 
deleted from the 3'-overhang produced after Apal digestion resulting in removal 
of nucleotides from the codons for Glycine-170 (GGC) and Proline-171. 

25 Thr He Thr Glu Of &6 \ 



ACCATCACT GAGG GCgST. 
TGGTAGTGA CTC ffi^GGGAT 

Apal 



AAGCG 
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i 
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The linker inserted at the ApaL-site without exonuclease digestion was a 50 -bp 
5'-phosphorylated linker made from oligonucleotides CF116 and CFH7. 

Apal-UTikzr f CF1 1 6+CF1 171 

P&e*x Sfi&BX 
StaaBl Fsel 



CPUS Pi-CTTftAT AATACGTATT AATEA&SGCC GGZCA&GCCC GGCTACXSTAiG GGCC 

CF117 CCGGGSATTA TTATGCATAA T2?AATTCCGG COGGTCCGGG- CCCATCcaxC-Pi 



This was ligated with pSAC35, which had been linearised with Apal and treated 
15 with calf intestinal alkaline phosphatase, to produce pDB2788 (Figure 18) and 
pDB2789 (not shown). Within pDB2788 ? the linker was in the correct orientation 
fox translation termination after proIino-271, whereas in pDB27S9 the linker was 
in the opposite orientation. 

20 The linker inserted at the Apat-$\tv with exomiclease digestion by T4 DNA 
polymerase was a 43-bp 5*-phosphorylated linker made from oligonucleotides 
CF106 and CF107, which was called the core termination linker. 



Core Termination-Linker fCF106+CF10T) 

gnaBT FseX Sma.1 j 

CF106 Pi-TAATAATACG TATTAATTAA GGCCGdCGAG GCCCGGGTAC GTA 

ATTATTATGC ATAATTAATT CCGGCCGGTC CG5GCCCATG GAT-Pi 



The core termination linker was ligated with pSAC35., whidh had been linearised 
35 with Apc&> digested with T4 DNA polymerase and treated with calf intestinal 
alkaline phosphatase. This ligation produced pDB2787 (Figure 19) with the linker 
cloned in the correct orientation for translation termination after glutamate-269, 
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The correct DNA sequences were confirmed in all clone? containing the ^pal- 
linkers, using oligonucleotide primers CF98 and CF99- 

The core termination linker (CF106+CF107) was also used for insertion into tbe 
Fspl-sitvz of pDB2783 (Figure 14). The core texminafton linker (CF106+CF107) 
was ligated into pDB2783 linearised by partial Fspl digestion, which had been 
treated with calf intestinal alkaline phosphatase, Plasmlds isolated from 
aprainycin resistant E. coli DH5a txansformants were screened by digestion with 
Fspl} and selected clones were sequenced with M13 forward and reverse primers. 



Plasmid pDB2801 (not shown) was identified containing two copies of the linker 
cloned in the correct orientation (with the Pacl-site nearest 'the REP2 gene). The 
extra copy of the linker was subsequently removed by first deleting a 1 16-bp Nrul- 
Hpal fragment containing an Fsel-site from the multiple cloning site region^ 
15 followed by digestion with Fsel and re-ligation to produce pDB2802 (Figure 20). 
DNA sequencing using oligonucleotide CF126 confiimfed the correct linker 
sequence* 

The 3 3 ll9-bp pDB2802 Xbal fragment was subsequently ligated with a 7,961-bp 
20 pSAC35 fragment produced by partial Xbal digestion and treatment with calf 
intestinal alkaline phosphatase to create pDB2805 (B-form) and pDB2806 (A- 
form) disintegration vectors (Figures 21 and 22, respectively). 



EXAMPLE 4 

25 

Insertion of DNA Linkers into the FLP Gene and Downstream Sequences in the 
Inverted Repeat ofpSACSS 



DNA linkers were inserted into pSAC35 to define the usefol limits for insertion of 
30 additional DNA into the FLP gene and sequences downstream in the inverted 
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repeat Figure 3 indicates the restriction sites used for these insertions and the 
affects on the Flp protein of translation termination at these sites- 

The linker inserted at the ite/I-site was a 49-bp S^phosphorylated linker made 
5 from oligonucleotides CF1 1 8 and CF1 19. ' 

Bell Linker TCF1 1 B+CF1 191 
10 PacX sbaBI 

CF1 18 P?L -GATr^CTAATAftT&CGTATT^^ 
15 CF119 TG^T^TATTATGCATAATTAATTCCGGCCGGTG CG<?<5C GCATGCATCTAG -Pi 

Due to Dam-methylation of the £cil-site in pSAC35, the Bcil-liaket was cloned 
into non-methylated pSAC35 DNA, which had been isolated from the E. coll 
strain ET12567 pUZS002 (MacNeil et at y 1992 s Gene 9 111, 61; Kieser a4 2000, 

20 Practical Streptomyces Genetics, The John Innes Foundation, Norwich). Plasmid 

i 

pSAC35 was linearised with Bcll 7 treated with calf intestinal' alkaline phosphatase, 
and Iigated with the ^c/I-linker to create pDB2Sl6 (Figure 23). DNA sequencing 
with oligonucleotide primers CF91 and CF100 showed that three copies of the 
BcllAmkes; were present in pDB2816 3 which were all in the correct orientation for 
25 translational termination of Flp after histidhie-353. 

Digestion of pDB2816 with Pad followed by self4igation, was performed to 
produce pDB2814 and pDB2815, containing one and two copies of the Bcll-linksr 

respectively (Figures 24 and 25). The DNA sequences of the linkers were 
30 confirmed using primers CF91 and CF100. In S. cerevisiae a truncated Flp (1- 

353) protein will be produced by yeast transformed with pDB2814, pDB2S15 or 

pDB2S16. 
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An additional plasmid pDB2846 (data not shown) was also produced by ligation 
of a single copy of the ite/Hinker in the opposite orientation to pDB2Sl4. This 
has the coding sequence for the first 352-residues from Flp followed by 14 
different residues before translation termination. 

5 

The linker inserted at the ##al-site was a 47-bp 5*-phosphorylated linker made 
ftom oligonucleotides CF1 14 and CF115. 

Jfe^Linker (CF1 14+P F115) 

10 $3£±T 

AnaBI Fael SrnaX 



IS 



CP114 Pi -AlSTPACTATAMACGTA^ 
CF1X5 ATATTATGCMIA^^ 



111© ifgal-linker was Hgated with pDB27S3, which had been linearised by partial 
20 Hgat digestion and treated with calf intestinal alkaline phosphatase to create 
pDB2Sll (Figure 26)- DNA sequencing with oligonucleotides CF90, CF91 and 
CF100 confirmed the correct linker insertion. 



The 3, 123 -bp Xbal fragment from pDB2811 was subsequently ligated with the 
25 7,961 -bp pSAC35 fragment, produced by partial Xbal digestion and treatment 
with calf intestinal alkaline phosphatase to produce pE>B2812 (B-foim) and 
pDB2813 (A-forxn) disintegration vectors containing DNA inserted at Hie Hgal- 
site (Figures 27 and 28, respectively), 

30 Plasmids pDB2803 and pDB2804 (Figures 29 and 30, respectively) with the core 
termination linker (CF106+CF107) inserted at the Fspl after FLP, were isolated by 
the same method used to construct pDB2B0L The correct linker insertions were 
confirmed by DNA sequencing. Flasmid pBB2804 contained the linker inserted 
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in the correct orientation (with the fad-site closest to the| FLP gene), whereas 
pDB2803 contained the linker in the opposite orientation. 

The pDB2S04 3,119-bp Xbal fragment was ligated with the 7,961-bp pSAC35 
5 fragment produced by partial Xbal digestion and treatment with calf intestinal 
alkaline phosphatase to create pDE2S07 (B-form) andj pDB28GS (A-foim) 
disintegration vectors containing DNA inserted at the Fjpl-site after FLP (Figures 
31 and 32 respectively). 

10 EXAMPLES 

Relative Stabilities of the JJEXI2 Marker in Yeast Transformed with 
pSAC35-Lihe Plasmids Containing DNA Linkers Inserted into the Small Unique 
Region and Inverted Repeats 

v \ i 

A S. cerevisiae strain was transformed with the pSAC354ike plasmids containing 
DNA linkers inserted into the US-region and inverted repeats. Cryopreserved 
trehalose stocks were prepared for testing plasmid stabilities (Table 3). Plasmid 
stabilities were analysed as described above for linkers inserted at the Xcral-sites 

20 in pSAC35. Duplicate flasks were set up for each insertion site analysed. In 
addition, to the analysis of colonies derived from cells after 3-days in shake flake 
culture;, colonies were grown and analysed from cells with a further 4-days shake 
flask culture. For this, lOOpL samples were removed from each 3-day old flask 
and sub-cultured in lOOmL YBPS broth for a Jurfher period of approximately 96 

25 hours (94-98 hrs) at 30,0°C in an orbital shaker, after which single colonies were 
obtained and analysed for loss of the LEU2 marker. In this case analysis was 
restricted to a single flask from selected strains, for which 50 colonies were 
picked. The overall results are summarised in Table 4. 
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Table 4 : Summary of plasmid stability data for DHA insertions into pSAC35 
Set 1 represents data frpm 3 days in nonselective shske flask cultijre. 
Set 2 represents data from 7 days in nonselective shake flask culture. 



5 A) JREP2 Insertion Sites 



Plasmid(s) 


Insertion 


Additional 


Relative Stability 




Site 


Details 


Setl 


Set 2 


pSAC35 




Control 


99% 


100% 


pDB2817 &pDB2818 


Xmnl 


REP2 (1-244) 


39% 


16% 


PDB2787 


4?oFT4pol. 


REP2 (1-269) 


4j5% 


0% 


pDB2788 


April 


REP2 (1-271) 


3'3% 


0% 


pDB2688 


Xcml 


Inverted Repeat 


100% 


100% 


pDB2S05&pDB2806 


Fspl 


Inverted Repeat 


100% 


100% 


B) FLP Insertion Sites 


i 




P|asmid(s) 


Insertion 


Additional 


Relative Stability 




Site 


Details 


Setl 


Set 2 


pDB2814 


BelL 


FLP (1-353) 


67% 


64% 


pDB2823 


Xcml 


FLP (1-382) 


64% 


53% 


pDB2812&pDB2813 


Hgal 


Inverted Repeat 


100% 


100% 


pDB2808 


Fspl 


Inverted Repeat 


100% 


100% 
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AH of the modified pSAC35 plasmids were able to transform yeast to leucine 
prototrophy, indicating that despite the additional DNA inserted within the 
fianctionally crowded regions of 2[im DNA, all could replicate and partition in S* 
cerevisiae. This applied to plasmids with 43-52 base-pair linkers inserted at all the 
sites in the 2\im US-region, as well as the larger DNA insertion containing the 
PDI1 gene. 
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For the linker insertion sites., data was reproducible between jboth experiments and 
duplicates. AH sites outside REP2 or FLP open reading frames, but within 
inverted repeats appeared to be 100% stable under the test conditions used. 
Plasmid instability (i-e. plasmid loss) was observed for linkers inserted into sites 
within the REP2 or FLP Open reading frames. The observed plasmid instability of 
REP2 insertions was greater than for FLP insertions. For' the REP2 insertions, 
loss of the LEJJ2 marker continued with the extended growth period in non- 
selective media, whereas there was little difference for the FLP insertions. 

Insertions into the REP2 gene produced Rep2 polypeptides truncated within a 
region known to function in self-association and binding to ltheSTB-locusof2nm 
(Sengupta et al, 2001, J. Bacterial, 183, 2306). 



15 Insertions into the FLP gene resulted in truncated Pip proteins. All the insertion 
sites were after tyrosine-343 in the C-terminal domain. Which is essential for 
correct functioning of the Flp protein (Prasad et al 7 1987 ? iProc. Natl Acad. ScL 
84 5 2189; Chen et al, 1992, Cell, 69, 647; Grainge et al, 2001, Jl Mol 
Biol, 314,717). 

20 

None of the insertions into the inverted repeat regions resulted in plasmid 
instability being detected, except for the insertion into the FLP Xcml-sitt, which 
also truncated the Flp protein product. The insertions at the J^spZ-sites in the 
inverted repeat regions were the closest to the FRT (Flp recognition target) 
25 regions, important for plasmid replication. 



pSAC35-like plasiirids have been constructed with 43-52 base-pair DNA linkers 
inserted into the REP2 open reading frame, or the FLP open reading frame or the 
inverted repeat sequences. In addition, a 1.9-kb DNA fragment containing the 
30 PDI1 gene was inserted into a DNA linker at fee JYcTrtl-site after REP 2. 
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. * 1™ Therefore despite inserting DNA into 
transform yeast to leucine prototrophy. Therefore, p » 

^ <>„ m nisfttnid DNA, the plasmid replication and 
functionally crowded regions of 2pm plasma f 

5 partitioning mechanisms had not been abolished. 
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. . ^ ^ leCtive medium indicated that inserting UNA 
wiftrlcer dmin£ growth m non-selecuve mk*" 

^TinTto iLted ^ tad not *e pUsond. ^ piasnnd 

faj However, **» a seduooon » pWd 

K8<Kng ftantes - potions defined „ ^ tot and second ^ of to 
invjon, to M Placid nevertheless h» • ^ « ** 

15 use in yeast when grown on selective-media. 
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CLAIMS 

I 

s 

A 2(i.m-fainily plasmid comprising a polynucleotide sequence insertion, 
deletion and/or substitution between the first base after the last fractional 
codon of at least one of either a REP2 gene or an FLP gene and the last 
base before the FRT site in an inverted repeat adjacent to said gene. 

The 2|um-family plasmid of Claim 1 wherein, other than the polynucleotide 
sequence insertion, deletion and/or substitution, the FLP gene and/or the 
REP2 gene has the sequence of a FLP gene and/or a REP2 gene, 
respectively, derived from a naturally occurring 2 jam-family plasmid. 

The 2|4ni-family plasmid of Claim 1 wherein the naturally occurring 2jj.m- 
family plasmid is selected from pSRl 3 pSB3 or pSB4 as obtained from 
Zygosaccharomyces rouxii, pSBl or pSB2 both as obtained from 
ZygQs&ccharomyces baitli, pSMl as obtained from Zygosaccharomyces 
fermentctth pKDl as obtained from Kluyveramyces drosopkilarum, pPMl 
as obtained from Pichia membranaefaciens, and the 2jLim plasmid as 
obtained from Saccharomyces cerevisiae. 

The 2p.m-family plasmid of Claim 2 or 3 wherein the sequence of the 
inverted repeat adjacent to said FLP and/or REP2 gene is derived from the 
sequence of the corresponding inverted repeat in the same naturally 
occurring 2 nm-family plasmid as the sequence fiom which the gene is 
derived. 

The 2 j^m-family plasmid of any one of Claims 2 to 4 wherein the naturally 
occurring 2 j^m- family plasmid is the 2 jam plasmid as obtained froio 
Saccharomyces cerevisiae* 
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6- The 2|_im~fainily plasmid of Claim 5 wherein the polynucleotide sequence 
insertion, deletion and/or substitution occurs at a position between the first 
base of codon 59 of the REP2 gene and the last base before the FRT site in 
5 the adjacent inverted repeat. 

7. The 2jim-facoily plasmid of Claim 5 or 6 wherein, other than the 
polynucleotide sequence insertion, deletion andtar substitution, the 
sequence of the REP2 gene and the adjacent inverted jrepeat is as defined by 

10 SEQ ID NO:l or variant thereof. 

8. The 2|Am-family plasmid of any one of Claims 1 to 7 wherein 
polynucleotide sequence insertion, deletion and/or substitution occurs at a 
position between the first base of the inverted repeat and the last base 

15 before the FRT site, 

9- The 2 pm- family plasmid of any one of Claims 1 to 7 wherein the 
polynucleotide sequence insertion, deletion and/or substitution occurs 
between the first base after the end of the REP2 coding sequence and the 
20 last base before the FRT site 9 such as at the first base after the end of the 

REP2 coding sequence, 

10. The 2(om-family plasmid of any one of Claims 1 to 7 wherein; other than 
the polynucleotide sequence insertion, deletion ajnd/or substitution, the 
25 inverted repeat that follows the REP2 coding sequence has a sequence 

derived from the corresponding region of the 2 jam plasmid as obtained 
from Saccharomyces cer&vmae and preferably the polynucleotide sequence 
insertion, deletion and/or substitution occurs at miXcml site or an F&pT site 
within the inverted repeat. 



30 
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11- The 2^rai~family plasmid of Claim 5 wherein the polynucleotide sequence 
insertion* deletion and/or substitution occurs at a position between the first 
base of codcra 344 of the FLP gene and the last base before the FRT site in 
the adjacent inverted repeat. 

5 

12. The 2p,m-f£mily plasmid of Claim 5 or 11 wherein, other than the 
polynucleotide sequence insertion, deletion and/or substitution, the 
sequence of the FLP coding sequence and the adjacent inverted repeat is as 
defined by SEQ ID NO:2 or variant thereof. 

10 

13. The 2(jLm-family plasmid of Claim 11 or 12 wherein the polynucleotide 
sequence insertions deletion and/or substitution occurs at a position between 
the first base of the inverted repeat and the last base before the FRT site. 

15 14. The 2^m-family plasmid of Claim 13 wherein the polynucleotide sequence 
insertion, deletion and/or substitution occurs at a position between the first 
base after the end of the FLP coding sequence and the last base before the 
FRT site. 



20 15. The 2|iun-f3mily plasmid of Claim 14 wherein the polynucleotide sequence 
insertion, deletion and/or substitution occurs at the first base after the end 
of the FLP coding sequence. i 



16. The 2 Jim-family plasmid of any one of Claims 1 1 to 15 wherein, other than 
as the polynucleotide sequence insertion, deletion and/or substitution, the 

inverted repeat that follows the FLP gene has a sequence derived from the 
corresponding region of the 2|xm plasmid as obtained from Saccharomyces 
cerevisiae 4 and preferably the polynucleotide sequence insertion, deletion 
and/or substitution occurs at an Hgal site or an Fspl site within the inverted 
30 repeat* 

70 
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17. The 2jxm-fa»ily plasmid of any one of the preceding claims comprising 
polynucleotide sequence insertions, deletions and/or smbstitations "between 
the first bases after the last functional codons of both of the REP2 gene and 
5 the FLP gene and the last bases before the FRT sites in the inverted repeats 

adjacent to each of said genes, which polynucleotide sequence insertions, 

deletions and/or substitutions can be the same or different 

i 

! 
i 

18- The 2 t um-family plasmid of any preceding claim additionally comprising a 
10 polynucleotide sequence inset1ion a deletion and/or substitution which is not 

at a position as defined in any one of the preceding claims. 

19. The 2jira~fenrily plasmid of Claim 18 wherein the polynucleotide sequence 
insertion, deletion and/or substitution occurs within an untranscribed region 

15 around an ARS sequence. 

i 

20. The 2^m-fan?ily plasmid of any one of the preceding claims wherein the, or 
at least one* polynucleotide sequence insertion, deletion and/or substitution 
is a polynucleotide sequence insertion. 

20 

21 . The 2jom-farnily plasmid of Claim 20 in which, the polynucleotide sequence 
insertion encodes an open reading frame, 

22. The 2 fan-family plasmid of Claim 21 in which the open reading frame 
25 encodes a heterologous protein. 

23. The 2fmi~family plasmid of Claim 22 in which the heterologous protein 

comprises the sequence of a protein involved in protein folding, or which 

has chaperons activity or is involved in the unfolded protein response, 

30 albumin, a monoclonal antibody, an etoposide, a serum protein (such as a 
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blood clotting factor), antistasin, a tick anticoaguladtt peptide 3 transferrin 
lactoferrin, endostatin, angiostatin, collagens, immunoglobulins, Fab' 
fragments, F(ab")2, scAb, scFv, interferons, interleulobris, IL10, ILIl, IL2, 
interferon a species and sub-species, interferon (J species and sub-species, 
interferon y species and sub-species, leptin, CNTF, CNTF Axl s, IL1 -receptor 
antagonist, erythropoietijn (EPO) and BPO mimics, thromhopoietiji (TFQ) 
and TFO mimics, prosaptide, cyanovirin-N, 5-helix, T20 peptide, T1249 
peptide, HIV gp41, HIV gpl20, urokinase, pronrbldnase, tPA, hirudin* 
platelet derived growth factor* parathyroid hormone, proinsnlin, insulin, 
glucagon, ghicagon-like peptides, insulin-like growth factor, calcitonin, 
growth hormone, transforming growth factor (J, tumour necrosis factor, G- 
CSF, GM-CSF, M-C5F, FGF, coagulation factors in both pre and active 
forms, including but not limited to plasminogen, fibrinogen, thrombin, pro- 
thrombin, pro-thrombin, von Willebrand's factor, ai-antitrypsin* 
plasminogen activators, Factor VII S Factor VIII, Factor IX, Factor X and 
Factor XHI, nerve growth factor, LACI, platelet-dferived endothelial cell 
growth factor (PD-ECGF), glucose oxidase, serum cholinesterase, 
aprotinin, amyloid precursor protein, inter-alpha trypsin inhibitor, 
antithrombin HI, apo-lipoprotein species, Protein C, Protein S, or a variant 
or fragment of any of the above. 

4. The 2(L«n-family plasmid of Claim 23 in which tfcle heterologous protein 
comprises the sequence of albumin, a variant or fragment thereof, or a 
fusion protein comprising the sequence of any of these. 

5. The 2f4m-family plasmid of Claim 23 in which the heterologous protein 
comprises the sequence of transferrin, a variant or fragment thereof, or a 
fusion protein comprising the sequence of any of these. 
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26* The 2jam-family plasmid of Claim 23 in which the! heterologous protein 
comprises the sequence of lactoferrin, a variant or fragment thereof or a 
fusion protein comprising the sequence of any of these. 

5 27. The 2jjm-family plasmid of Claim 23 in which the heterologous protein 
comprises the sequence of Fc, a variant or fragment thereof, or a fusion 
protein comprising the sequence of any of these. 



28. The 2 Jim-family plasmid of Claim 23 in which the heterologous protein 
io comprises the sequence of a protein involved in protein folding, or which 

has chaperone activity or is involved in the unfolded protein response taken 
from the list AHA1, CCT2 9 CCT3, CCT4, CCT5, CCT6, CCT7, CCT8, 
GN51, CPR3, CPR6, EPS1, ERQ1 9 JBUG1, FM01 9 HGH1, H$P10 7 HSP12, 
HSP104, HSF26, ffSPSQ, HSP42, HSP6Q, HSP78, HSP32> JEM1, MDJ1, 
15 MDJ2, MPD1, MPJD2, PDI1 9 PFDJ, ABC1, APJ1, ATPlh ATP12, BTT1, 

CDC37, CFR7, HSC82, KAR2 9 LHSh MGE2, MRS II* NOB1, ECM10, 
SSAI, SSA2, SSA3, SSA4, SSCJ, SSE2 > SIL1, SLS1, UBI4, ORMI, ORM2, 
PERI, PTC2, PSE1 and HAC1 or a truncated intronless HACL 

20 29. The 2^im~faTirily plasmid of Claim 23 or 28 in v^hich the chaperone is 
protein disulphide isomerase (PDI). 

30. The 2|im-famiJy plasmid of any one of Claims 22 to 29 in which the 
heterologous protein comprises a secretion leader sequence, 

25 

31. The 2nm-family plasmid of Claim 22 in which the heterologous protein 
comprises the sequence of a bacterial selectable marker and/or a yeast 
selectable marker. 
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32. The 2 jam-family plasmid of Claim 31 in which the bacterial selectable 
marker is a P-lactamase gene and/or the yeast selectable marker is a LEU2 
selectable marker. 

5 33. The 2|rni-family plasmid according to any one of the preceding claims 
which plasmid comprises (i) a heterologous sequence encoding a protein; 
(ii) a heterologous sequence encoding a protein comprising the sequence of 
a protein involved in protein folding, a chaperone or a protein involved in 
the unfolded protein response, preferably protein disulphide isomerase; and 
io (iii) a heterologous sequence encoding a protein comprising the sequence of 

a selectable marker; wherein at least one of the heterologous sequences 
occurs at a position as defined by any one of Claims 1 to 16, 



34. A method of preparing a plasmid as defined by any one of the preceding 
15 claims comprising- 

(a) providing a plasmid comprising the sequence of a REP2 gene and the 
inverted repeat that follows the REP2 gene ? or a PLP gene and the inverted 
repeat that follows the FLP gene ? in each case the inverted repeat 
20 comprising an FRT site; 

(b) providing a polynucleotide sequence and inserting the polynucleotide 
sequence into the plasmid at a position as defined in any one of Claims 1 to 
16; and/or 



25 



(c) deleting some or all of the nucleotide bases at the positions defined in any 
one of Claims 1 to 16; and/or 



(d) substituting some or all of the nucleotide bases at the positions defined in 
30 any one of Claims 1 to 16 with alternative nucleotide bases. 
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35, A plasmid obtainable by the method of Claim 34. 

36. A host cell comprising a plasmid as defined by any one of Claims 1 to 33 
and 35. 

37. A host cell according to Claim 36 which is a yeast cell 

38. A host cell according to Claim 36 or 37 in which the plasmid is stable as a 
multicopy plasmid, 

39. A host cell according to Claim 38 in which the plasmid is based on pSRl 3 
pSB3 or pSB4 and the yeast cell is Zygosaccharomycss rouxii, the plasmid 
is based on pSBl or pSB2 and the yeast cell is Zygosaccharomyces bazlli, 
the plasmid is based on pSMl and the yeast cell fe Zygosaccharomycss 
fermentati, the plasmid is based on pKDl and the yeast cell is 
Kluyveromyces drosophilamm* the plasmid is based on pPMl and the yeast 
cell is Pickia membranaefaciens or the plasmid is based on the 2\xm plasmid 
and the yeast cell is Saccharomyces cereviside or Saccharomyces 
carlsbergensis. 

40. A host cell according to Claim 38 or 39 in which, if the plasmid contains, 
or is modified to contain, a selectable marker then stability, as measured by 
the loss of the marker, is at least 1%, 2% 3%, 4% 3 5%, 10%, 15%, 20%, 
25%, 30%, 40%, 50%, 60%, 70%, 75%, 80% ? 85%, , 90% J 95% ? 96%, 97%, 
98%, 99%, 99.9% or substantially 100% after 5 generations. 



Ah 



A method of producing a protein comprising the steps of - 



30 (a) 



providing a plasmid as defined by any one of Claims 1 to 33 or 35 
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(b) providing a suitable host cell; 1 

i 

(c) transforming the host cell with the plasmid; and 

5 

(d) cultuiing the transformed host cell it* a culture medium; 

(e) thereby to produce the protein. 

10 42. A method of producing a protein comprising the steps of providing a host 
cell as defined by my one of Claims 36 to 40 which host cell comprises a 
plasmid as defined by any one of Claims 1 to 33 or 35 and cultuiing the 
host cell in a culture medium thereby to produce the protein. 

is 43. A method according to Claim 41 or 4% further comprising the step of 
isolating the thus produced protein from the cultured host cell or the culture 
medium. 



44. A method according to Claim 43 further comprising the step of purifying the 
20 thus isolated protein to a commercially acceptable level of purity. 

45. A method according to Claim 44 further comprising the step of formulating 
the thus purified protein with a earner or diluent, and optionally presenting 
the thus formulated protein in a unit form. 

25 

46. A method according to Claim 43 further comprising the step of purifying the 
thus isolated protein to a pharmaceutical^ acceptable level of purity, 



47, A method according to Claim 44 further comprising the step of formulating 

30 the thus purified protein with a pharmaceutical^ acceptable carrier or 

i 



23/12 2003 12:43_FAX_0115a522 201 



ERIC POTTER CIARKSON 



BI0S2 



aitaeot and optionally prating the fhas fcm.ula.ed protein in a unit 
dosage foim. ! 



77 



100:89820 ->23-0ec- Q3AT2 : A9 = ;| 



23/12 2003 12:43 FAX 03,1 59522201 



ERIC POTTER CLARKSDN 



ABSTRACT 

I 

i 

MODIFIED PLASMID AND USE THEREOF 

5 The present invention provides a 2pjn-family plasmid comprising a polynucleotide 
sequence insertion, deletion and/or substitution between the first base after the last 
functional codon of at least one of either a REP2 gene or sa l FLP gene and the last 
base before the FRT site in an inverted repeat adjacent to saii gene. 
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Figure 1 
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Figure 5 
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Figure 6 
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Figure 21 
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Figure 22 
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